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NUCLEOTIDE SEQUENCES FOR GENE REGULATION 
AND METHODS OF USE THEREOF 



This work was made, in part, with Government support by the National Institutes of 
5 Health grant numbers HL38644 and HL53713. The Government has certain rights in this 
invention. 

FIELD OF THE INVENTION 

The invention relates to nucleic acid sequences which regulate expression of a 

10 nucleotide sequence of interest. In particular, the invention relates to nucleic acid sequences 
which regulate expression of a nucleotide sequence of interest in an age-related manner 
and/or in a liver-specific manner. The invention further relates to methods of using the 
regulatory nucleic acid sequences provided herein for age-related and/or liver-specific 
expression of nucleotides sequences of interest. The invention also relates to host cells and 

15 to transgenic non-human animals which harbor the regulatory nucleic acid sequences of the 
invention. The compositions and methods of the invention are useful in regulating 
expression of a nucleotide sequence of interest in an age-related and/or liver-specific manner. 

BACKGROUND OF THE INVENTION 

20 A multitude of human diseases (e.g,, thrombosis, cardiovascular diseases, diabetes, 

Alzheimer's disease, cancer, osteoporosis, osteoarthritis, Parkinson's disease, dementia) are 
associated with increasing age and result in serious effects on the quality of life and on the 
life expectancy of individuals suffering from such diseases. Other diseases (e.g., cirrhosis, 
primary and metastatic neoplasia, Wilson disease, hepachromatosis, infectious hepatitis, 

25 hepatic necrosis, Gilbert disease, Criggler-Najar disease) v^hich afflict the liver also have 
serious clinical manifestations and are responsible for high morbidity and mortality. 

The treatment of age-related diseases (i.e., diseases whose prevalence and/or severity 
of clinical manifestations increases with the age of the patient) and diseases afflicting the 
liver focuses on the alleviation of the general symptoms of the disease using one or a 

30 combination of two modalities, i.e., non-pharmacological treatment and pharmacological 
treatment. Non-pharmacological treatment include, for example, periods of bed rest and 
dietary changes. Non-pharmacological treatment is often used as an adjunct to 
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pharmacological treatment which involves the use of drugs. Unfortunately, many of the 
commonly used pharmacological agents have numerous side effects and their use is further 
exacerbated by the non-responsiveness by many patients with severe disease, who, 
paradoxically, are in most need of treatment. Both non-pharmacological and 
5 pharmacological treatments provide unsatisfactory approaches to treating age-related and 
liver-associated diseases because these approaches are often ineffective, their effects are 
inconsistent, and are directed to alleviating the general symptoms of disease, rather than to 
specifically addressing the source of morbidity and mortality. Moreover, no suitable animal 
models are currently available to rationally design drugs which target specific biochemical 
10 and physiological pathways which are associated with age-related and with liver-associated 



What is needed are methods for age-related and liver-specific gene expression and 
models for age-related and liver-specific diseases. 

1 5 SUMMARY OF THE INVENTION 

The invention provides nucleic acid sequences which regulate expression. of a 
nucleotide sequence of interest in an age-related manner, as well as nucleic acid sequences 
which direct liver-specific expression of a gene of interest. Further provided by the 
invention are transgenic animals which may be used as models for age-related and/or liver 

20 specific diseases. 

In one embodiment, the invention provides a recombinant expression vector 
comprising in operable combination i) a nucleic acid sequence of interest, ii) a promoter 
sequence, and iii) one or more age regulatory sequences selected from SEQ ID NO:l, SEQ 
ID NO:3, a portion of SEQ ID N0:1, and a portion of SEQ ID NO:3. Without intending to 

25 limit the invention to any particular type or source of nucleic acids sequence of interest, in a 
preferred embodiment, the nucleic acid sequence of interest encodes a protein selected fi:-om 
factor Vni, factor VII, factor IX, factor X, prothrombin, protein C, antithrombin III, tissue 
factor pathway inhibitor, LDL-receptor, himian a 1 -antitrypsin, antithrombin HI, fibrinolytic 
pathway factors and inhibitors, PEA-3 protein, PEA-3 related proteins including Ets family 

30 transcriptional factors, P-galactosidase, and luciferase. While it is not intended that the 
invention be restricted to any particular type or source of promoter sequence, in an 
altemative preferred embodiment, the promoter sequence is selected firom human factor IX 



diseases. 
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promoter, cytomegalovirus promoter, tRNA promoter, 5S rRNA promoters, histone gene 
promoters, RSV promoter, retrovirus LTR promoter, SV40 promoter, PEPCK promoter, MT 
promoter, SRa promoter, P450 family promoters, GAL7 promoter, T7 promoter, T3 
promoter, SP6 promoter, Kl 1 promoter, and HIV promoter. It is not contemplated that the 
5 invention be limited to any particular age regulatory sequence which is a portion of SEQ ID 
NO:l. However, in another preferred embodiment, the age regulatory sequence which is a 
portion of SEQ ID NO:l is selected from SEQ ID NO:2, SEQ ID NO:33, SEQ ID NO:34, 
SEQ ID NO:35, SEQ ID NO:36, SEQ ID NO:37, and SEQ ID NO:38. Without intending to 
limit the invention to any particular rage regulatory sequence which s a portion of SEQ ID 

10 NO:3, in yet another preferred embodiment, the age regulatory sequence which is a portion 
of SEQ ID NO:3 is selected from SEQ ID NO:51, SEQ ID NO:52, SEQ ID NO:53, SEQ ID 
NO:54, SEQ ID NO:55, SEQ ID NO:56, SEQ ID NO:57, SEQ ID NO:58, SEQ ID NO:59, 
SEQ ID NO:60, and SEQ ID NO:61. 

Also provided by the invention is a host cell containing a recombinant expression 

15 vector comprising in operable combination i) a nucleic acid sequence of interest, ii) a 

promoter sequence, and iii) one or more age regulatory sequences selected from SEQ ID 
NO:l, SEQ ID NO:3, a portion of SEQ ID NO:l, and a portion of SEQ ID NO:3. Without 
intending to limit the invention to the environment in which the host cell is contained, in one 
preferred embodiment, the host cell is comprised in a tissue or organ in a living animal. In 

20 another preferred embodiment, the host cell is a gamete. In yet another preferred 

embodiment, the host cell is selected from bacterial cell, yeast cell, plant cell, insect cell, and 
mammalian cell. 

The invention also provides a recombinant expression vector comprising in operable 
combination i) a nucleic acid sequence of interest, ii) a promoter sequence, and iii) a 

25 functional homolog of one or more age regulatory sequences selected from SEQ ID NO:l, 
SEQ ID NO:3, a portion of SEQ ID NO:l, and a portion of SEQ ID NO:3. Without 
limiting the invention to the type or source of the nucleic acid sequence of interest, in one 
preferred embodiment, the nucleic acid sequence of interest encodes a protein selected from 
factor VIII, factor VII, factor IX, factor X, prothrombin, protein C, anti thrombin III, 

30 fibrinolytic pathway factors and inhibitors, PEA-3 protein, PEA-3 related proteins including 
Ets family transcriptional factors, p-galactosidase, and luciferase. While it is not intended 
that the invention be limited to the type or source of the promoter sequence, in an altemative 
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preferred embodiment, the promoter sequence is selected from human factor IX promoter, 
cytomegalovirus promoter, tRNA promoter, 5S rRNA promoters, histone gene promoters, 
RSV promoter, retrovirus LTR promoter, SV40 promoter, PEPCK promoter, MT promoter, 
SRa promoter, P450 family promoters, GAL7 promoter, T7 promoter, T3 promoter, SP6 
5 promoter, Kl 1 promoter, and HIV promoter. Though it is not contemplated that the 
invention be limited to the portion of SEQ ID NO:l which has age-related regulatory 
activity, in another preferred embodiment, the age regulatory sequence which is a portion of 
SEQ ID NO:l is selected from SEQ ID NO:2, SEQ ID NO:33, SEQ ID NO:34, SEQ ID 
NO:35, SEQ ID NO:36, SEQ ID NO:37, and SEQ ID NO:38. Without intending to limit 

10 the invention the portion of SEQ ID NO:3 which has age-related regulatory activity, in yet 
another preferred embodiment, the age regulatory sequence which is a portion of SEQ ID 
NO:3 is selected from SEQ ID NO:51, SEQ ID NO:52, SEQ ID NO:53, SEQ ID NO:54, 
SEQ ID NO:55, SEQ ID NO:56, SEQ ID NO:57, SEQ ID NO:58; SEQ ID NO:59, SEQ ID 
NO:60, and SEQ ID NO:61. 

15 Also provided herein is a host cell containing recombinant expression vector 

comprising in operable combination i) a nucleic acid sequence of interest, ii) a promoter 
sequence, and iii) a functional homolog of one or more age regulatory sequences selected 
from SEQ ID NO:l, SEQ ID NO:3, a portion of SEQ ID NO:l, and a portion of SEQ ID 
N0:3, Without intending to limit the invention to the environment in which the host cell is 

20 contained, in one preferred embodiment, the host cell is comprised in a tissue or organ in a 
living animal. In an alternative preferred embodiment, the host cell is a gamete. In another 
preferred embodiment, the host cell is selected from bacterial cell, yeast cell, plant cell, 
insect cell, and mammalian cell. 

The invention also provides a method, comprising: a) providing: i) a cell, ii) a nucleic 

25 acid sequence of interest, iii) a promoter sequence, and iv) one or more age regulatory 

sequences selected from SEQ ID NO:l, SEQ ID NO:3, a portion of SEQ ID NO:l, and a 
portion of SEQ ID NO:3; b) operably linking the nucleic acid sequence of interest, the 
promoter sequence, and the one or more age regulatory sequences to produce a transgene; 
and c) introducing the transgene into the cell to create a treated cell under conditions such 

30 that the nucleic acid sequence of interest is expressed in the treated cell. Without intending 
to limit the treated cell to any particular environment, in one preferred embodiment, the 
treated cell is comprised in a tissue or organ in a living animal. 
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The invention further provides a substantially purified nucleic acid sequence 
comprising a nucleotide sequence selected from a functional homolog of SEQ ID NO:l and 
of the complement thereof. 

Also provided herein is a substantially purified nucleic acid sequence comprising a 
5 nucleotide sequence selected from a frmctional- homolog of SEQ ID NO:3 and of the 
complement thereof. 

Also provided by the present invention is a substantially purified nucleic acid 
sequence comprising a portion of a nucleotide sequence selected from a functional homolog 
of SEQ ID NO:l and of the complement thereof. In one embodiment, the portion is SEQ 
10 ID NO:2. In an alternative embodiment, the portion is selected from SEQ ID NO:33, SEQ 
ID NO:34, SEQ ID NO:35, SEQ ID NO:36, SEQ ID NO:37 and SEQ ID NO:38. 

The invention also provides a substantially purified nucleic acid sequence comprising 
a portion of a nucleotide sequence selected from a functional homolog of SEQ ID NO:3 and 
of the complement thereof. In one embodiment, the portion is selected from SEQ ID 
15 NO:51, SEQ ID NO:52, SEQ ID NO:53, SEQ ID NO:54, SEQ ID NO:55, SEQ ID NO:56, 
SEQ ID NO:57, SEQ ID NO:58, SEQ ID NO:59, SEQ ID NO:60, and SEQ ID NO:61. 

Also provided herein is a substantially purified nucleic acid sequence which ^ 
hybridizes under stringent hybridization conditions with SEQ ID NO: 1 or with the 
complement thereof, wherein the nucleic acid sequence is characterized by having age-related 
20 regulatory activity, and by having greater than 63% and less than 100% homology to the 
SEQ ID NO: 1. 

The invention also provides a substantially purified nucleic acid sequence which 
hybridizes under stringent hybridization conditions with SEQ ID NO:3 or with the 
complement thereof, wherein the nucleic acid sequence is characterized by having age-related 
25 regulatory activity, and by having greater than 60% and less than 100% homology to the 
SEQ ID NO:3. 

The invention additionally provides a recombinant expression vector comprising at 
least a portion of a nucleotide sequence selected from a functional homolog of SEQ ID NO:l 
and of the complement thereof. 
30 Also provided herein is a recombinant expression vector comprising at least a portion 

of a nucleotide sequence selected from a fiinctional homolog of SEQ ID NO:3 and of the 
complement thereof. 
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The invention also provides a transgenic cell comprising at least a portion of a 
nucleotide sequence selected from a functional homolog of SEQ ID NO:l and of the 
complement thereof In one embodiment, the nucleotide sequence is operably linked to a 
promoter and to a nucleic acid sequence of interest. In a preferred embodiment, the 
5 transgenic cell is comprised in an animal. In a more preferred embodiment, the nucleic acid 
sequence of interest is expressed in an age-related manner in the transgenic cell. 

The invention additionally provides a transgenic cell comprising at least a portion of 
a nucleotide sequence selected from a functional homolog of SEQ ID NO:3 and of the 
complement thereof In one embodiment, the nucleotide sequence is operably linked to a 

10 promoter and to a nucleic acid sequence of interest. In a preferred embodiment, the 

transgenic cell is comprised in an animal. In a more preferred embodiment, the nucleic acid 
sequence of interest is expressed in an age-related manner in the transgenic cell. 

The invention also provides a method for expressing a nucleic acid sequence of 
interest in a cell, comprising: a) providing: i) a cell; ii) a nucleic acid sequence of interest; 

15 iii) a promoter sequence; iv) SEQ ID NO:l; and v) SEQ ID NO:3; b) operably hnking the 
nucleic acid sequence of interest, the promoter sequence, the SEQ ID NOrl and the SEQ ID 
NO:3 to produce a transgene; and c) introducing the transgene into the cell to produce a 
transgenic cell under conditions such that the nucleic acid sequence of interest is expressed 
in the transgenic cell. In one enibodiment, the cell expresses a recombinant protein 

20 identified as SEQ ID NO:47. In an alternative embodiment, the cell is selected from HepG2 
cell, fibroblast cell, myoblast cell, and endothelial cell. In another embodiment,, the cell is a 
fertilized egg cell, and the transgenic cell is a transgenic fertilized egg cell. In a preferred 
embodiment, the method further comprises d) introducing the transgenic fertilized egg cell 
into a non-human animal and permitting the animal to deliver progeny containing the 

25 transgene. In a more preferred embodiment, the progeny is characterized by age-related 
expression of the nucleic acid sequence of interest. In an alternative more preferred 
embodiment, the progeny is characterized by liver-specific expression of the nucleic acid 
sequence of interest. In another preferred embodiment, the fertilized egg cell is derived from 
a mammal of the order Rodentia. In a more preferred embodiment, the fertilized egg cell is 

30 a mouse fertilized egg cell. In yet another embodiment, the promoter is selected from 

human factor IX promoter, cytomegalovirus promoter, tRNA promoter, 5S rRNA promoters, 
histone gene promoters, RSV promoter, retrovirus LTR promoter, SV40 promoter, PEPCK 
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promoter, MT promoter, SRa promoter, P450 family promoters, GAL7 promoter, T7 
promoter, T3 promoter, SP6 promoter, Kl 1 promoter, and HIV promoter. In a further 
embodiment, the nucleic acid sequence of interest encodes a protein selected from factor 
VIII, factor VII, factor IX, factor X, prothrombin, protein C, antithrombin III, fibrinolytic 
5 pathway factors and inhibitors, PEA-3 protein, PEA-3 related proteins including Ets family 
transcriptional factors, p-galactosidase, and luciferase. 

The invention also provides a method for expressing a nucleic acid sequence of 
interest in a cell, comprising: a) providing: i) a cell; ii) a nucleic acid sequence of interest; 
iii) a promoter sequence; iv) a portion of SEQ ID NO: I; and v) a portion of SEQ ID NO:3; 

10 b) operably linking the nucleic acid sequence of interest, the promoter sequence, the portion 
of SEQ ID NO:l and the portion of SEQ ID NO:3 to produce a transgene; and c) 
introducing the. transgene into the cell to produce a transgenic cell under conditions such that 
the nucleic acid sequence of interest is expressed in the transgenic cell. 

Additionally, provided by the invention is a method for expressing a nucleic acid 

15 sequence of interest in a cell, comprising: a) providing: i) a cell; ii) a nucleic acid sequence 
of interest; iii) a promoter sequence; and iv) SEQ ID NO:l; b) operably linking the nucleic 
acid sequence of interest, the promoter sequence, and the SEQ ID NO: 1 to produce a 
transgene; and c) introducing the transgene into the cell to produce a transgenic cell under 
conditions such that the nucleic acid sequence of interest is expressed in the transgenic cell. 

20 Also provided herein is a method for expressing a nucleic acid sequence xyf interest in 

a cell, comprising: a) providing: i) a cell; ii) a nucleic acid sequence of interest; iii) a 
promoter sequence; and iv) a portion of SEQ ID NO:l; b) operably linking the nucleic acid 
sequence of interest, the promoter sequence, and the portion of SEQ ID NO:l to produce a 
transgene; and c) introducing the transgene into the cell to produce a transgenic cell under 

25 conditions such that the nucleic acid sequence of interest is expressed in the transgenic cell. 

The invention further provides a method for expressing a nucleic acid sequence of 
interest in a cell, comprising: a) providing: i) a cell; ii) a nucleic acid sequence of interest; 
iii) a promoter sequence; and iv) SEQ ID NO:3; b) operably linking the nucleic acid 
sequence of interest, the promoter sequence, and the SEQ ID NO:3 to produce a transgene; 

30 and c) introducing the transgene into the cell to produce a transgenic cell under conditions 
such that the nucleic acid sequence of interest is expressed in the transgenic cell. 
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Further provided by the invention is a method for expressing a nucleic acid sequence 
of interest in a cell, comprising: a) providing: i) a cell; ii) a nucleic acid sequence of interest; 
iii) a promoter sequence; and iv) a portion of SEQ ID N0:3; b) operably linking the nucleic 
acid sequence of interest, the promoter sequence, and the portion of SEQ ID N0:3 to 
5 produce a transgene; and c) introducing the transgene into the cell to produce a transgenic 
cell under conditions such that the nucleic acid sequence of interest is expressed in the 
. transgenic cell. 

The invention further also provides additional sequences derived from the hFIX gene. 
In particular, the invention provides a substantially purified nucleic acid sequence comprising 

10 at least a portion of SEQ ID NO:93. In one embodiment, the portion has age-related 
regulatory activity. In an alternative embodiment, the portion is selected from SEQ ID 
NO:91, SEQ ID NO:94, SEQ ID NO:95; SEQ ID NO:96, SEQ ID NO:97, SEQ ID NO:98, 
SEQ ID NO:99, SEQ ID NO: 100, SEQ ID NO: 101, SEQ ID NO: 102, SEQ ID NO: 103, 
SEQ ED NO:104, SEQ ID NO:105, SEQ ID NO:106, SEQ ID NO:107, SEQ ID NO:108, 

15 SEQ ID NO:109, SEQ ID NO:110, SEQ ID NO:lll, SEQ ID NO:112, SEQ ID NO:113, 
SEQ ID NO:114, SEQ ID NO:I15, SEQ ID N0:116, SEQ ID NO:117, SEQ ID NO:118, 
SEQ ID NO:119, SEQ ID NO:120, SEQ ID NO:121, SEQ ID NO:122, SEQ ID NO:123, 
SEQ ID NO:124, SEQ ID NO:125, SEQ ID NO:126, SEQ ID NO:127, SEQ ID NO:128, 
SEQ ID NO:129, SEQ ID NO:130, SEQ ID NO:131, SEQ ID NO:132, SEQ ID NO:133, 

20 SEQ ID NO:134, SEQ ID NO:135, SEQ ID NO:136, SEQ ID NO: 137, SEQ ID NO:138, 
SEQ ID NO:139, SEQ ID NO:140, SEQ ID NO:141, SEQ ID NO:142, SEQ ID NO:143, 
and SEQ ID NO:144. In a preferred embodiment, the portion is SEQ ID NO:91. 

Also provided herein is a substantially purified nucleic acid sequence which 
hybridizes under high stringency hybridization conditions with a nucleotide sequence selected 

25 from SEQ ID NO:91, the complement of SEQ ID N0:91, SEQ ID NO:93, and the 

complement of.SEQ ID NO:93. In one embodiment, the nucleic acid sequence has age- 
related regulatory activity. 

The invention also provides a substantially purified nucleic acid sequence comprising 
a functional homolog of an age-related regulatory sequence selected fi"om SEQ ID NO:93 

30 and portions thereof. 

The invention further provides expression vectors containiifig hFIX sequences. In one 
embodiment, the invention provides a recombinant expression vector comprising in operable 
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combination i) a nucleic acid sequence of interest, ii) a promoter sequence, and iii) an age- 
related regulatory sequence selected from SEQ ID NO:93 and portions thereof. In a 
preferred embodiment, the nucleic acid sequence of interest encodes a protein selected from 
factor VIII, factor VII, factor IX, factor X, prothrombin, protein C, antithrombin III, tissue 
5 factor pathway inhibitor, LDL-receptor, human a 1 -antitrypsin, antithrombin III, PEA-3 

protein, p-galactosidase, and luciferase. In an alternative embodimnt, the promoter sequence 
is selected from human factor IX promoter, cytomegalovirus promoter, tRNA promoter, 5 S 
rRNA promoters, histone gene promoters, RSV promoter, retrovirus LTR promoter, SV40 
promoter, PEPCK promoter, MT promoter, SRa promoter, P450 family promoters, GAL7 

10 promoter, T7 promoter, T3 promoter, SP6 promoter, Kll promoter, and HIV promoter. In 
yet another alternative embodiment, the portion of SEQ ID NO: 93 is selected from SEQ ID 
NO:91, SEQ ID NO:94, SEQ ID NO:95; SEQ ID NO:96, SEQ ID NO:97, SEQ ID NO:98, 
SEQ ID NO:99, SEQ ID NOilOO, SEQ ID NO:101, SEQ ID NO:102, SEQ ID NO:103, 
SEQ ID NO: 104, SEQ ID NO: 105, SEQ ID NO: 106, SEQ ID NO: 107, SEQ ID NO: 108, 

15 SEQ ID NO:109, SEQ ID NO:110, SEQ ID NOzlll, SEQ ID NO:112, SEQ ID N0:113, 
SEQ ID NO:114, SEQ ID NO:115, SEQ ID NO:116, SEQ ID NO:117, SEQ ID NO:118, 
SEQ ID NO:119, SEQ ID NO:120, SEQ ID NO:121, SEQ ID NO:122, SEQ ID NO:123, 
SEQ ID NO:124, SEQ ID NO:125, SEQ ID NO:126, SEQ ID NO:127, SEQ ID NO:l28, 
SEQ ID NO:129, SEQ ID NO:.130, SEQ ID NO:131, SEQ ID NO:132, SEQ ID NO:133, 

20 SEQ ID NO:134, SEQ ID NO:135, SEQ ID NO:136, SEQ ID NO:137, SEQ ID NO:138, 
SEQ ID NO:139, SEQ ID NO:140, SEQ ID NO:141, SEQ ID NO:142, SEQ ID NO:143, 
and SEQ ID NO: 144. In a more preferred embodiment, the portion is SEQ ID NO:9L In 
another embodiment, the expression vector further comprises in operable combination an 
age-related regulatory sequence selected from SEQ ID NO:l and portions thereof 

25 The invention further provides cells containing hFIX-derived sequences. In 

particular, the invention provides a host cell containing a recombinant expression vector 
comprising in operable combination i) a nucleic acid sequence of interest, ii) a promoter 
sequence, and iii) an age-related regulatory sequence selected from SEQ ED NO:93 and 
portions thereof. In one embodiment, the host cell is comprised in a tissue or organ in a 

30 living animal. In an alternative embodiment, the host cell is a gamete. In another 

alternative embodiment, the host cell is selected from bacterial cell, yeast cell, plant cell, 
insect cell, and mammalian cell. 
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The invention additionally provides a recombinant expression vector comprising in 
operable combination i) a nucleic acid sequence of interest, ii) a promoter sequence, and iii) 
a functional homolog of an age-related regulatory sequence selected from SEQ ID NO: 93 
and portions thereof. In one embodiment, the nucleic acid sequence of interest encodes a 
protein selected from factor Vni, factor VII, factor IX, factor X, prothrombin, protein C, 
anti thrombin III, tissue factor pathway inhibitor, LDL-receptor, human a 1 -antitrypsin, 
antithrombin III, PEA-3 protein, p-galactosidase, and luciferase. In an altemative 
embodiment, the promoter sequence is selected from human factor IX promoter, 
cytomegalovirus promoter, tRNA promoter, 5S rRNA promoters, histone gene promoters, 
RSV promoter, retrovirus LTR promoter, SV40 promoter, PEPCK promoter, MT promoter, 
SRa promoter, P450 family promoters, GAL7 promoter, T7 promoter, T3 promoter, SP6 
promoter, Kl 1 promoter, and HIV promoter. In yet another embodiment, the portion of 
SEQ ID NO:93 is selected from SEQ ID NO:91, SEQ ID NO:94, SEQ ID NO:95; SEQ ID 
NO:96, SEQ ID NO:97, SEQ ID NO:98, SEQ ID NO:99, SEQ ID NO: 100, SEQ ID 
NO:101, SEQ ID NO:102, SEQ ID NO:103, SEQ ID NO:104, SEQ ID NO:I05, SEQ ID 
NO:106, SEQ ID NO:107, SEQ ID NO:108, SEQ ID NO:109, SEQ ID NO:110, SEQ ID 
NO:lll, SEQ ID NO:112, SEQ ID NO:113, SEQ ID NO:114, SEQ ID NO:115, SEQ ID 
NO:116, SEQ ID NO:117, SEQ ID N0:118, SEQ ID NO:119, SEQ ID NO:120, SEQ ID 
N0:121, SEQ ID NO:122, SEQ ID NO:123, SEQ ID NO:124, SEQ ID NO:125, SEQ ED 
NO:l26, SEQ ID NO:127, SEQ ID NO:128, SEQ ID NO:129, SEQ ID NO:130, SEQ ID 
NO:131, SEQ ID NO:132, SEQ ID NO:133, SEQ ID NO:134, SEQ ID NO:135, SEQ ID 
NO:136, SEQ ID NO:137, SEQ ID NO:138, SEQ ID NO:139, SEQ ID NO:140, SEQ ID 
NO:141, SEQ ID NO:142, SEQ ID NO:143, and SEQ ID NO:I44. In yet a further 
embodiment, the portion is SEQ ID NO:9L In another embodiment, the expression vector 
further comprises in operable combination an age-related regulatory sequence selected from 
SEQ ID NO:l and portions thereof. 

Also provided herein is host cell containing a recombinant expression vector 
comprising in operable combination i) a nucleic acid sequence of interest, ii) a promoter 
sequence, and iii) a frmctional homolog of an age-related regulatory sequence selected from 
SEQ ID NO:93 and portions thereof. In one embodiment, the host cell is comprised in a 
tissue or organ in a living animal. In another embodiment, the host cell is a gamete. In yet 
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another embodiment, the host cell is selected from bacterial cell, yeast cell, plant cell, insect 
cell, and mammalian cell. 

A.lso provided herein are methods for using hFIX-derived sequences. For example, 
the invention provides a method for expressing a nucleic acid sequence of interest, 
5 comprising: a) providing: i) a cell; ii) a nucleic acid sequence of interest; iii) a promoter 
sequence; and iv) an age-related regulatory sequence selected from' SEQ ID NO:93 and 
portions thereof; b) operably linking the nucleic acid sequence of interest, the promoter 
sequence, and the age-related regulatory sequence to produce a transgene; and c) introducing 
the transgene into the cell to create a treated cell under conditions such that the nucleic acid 

10 sequence of interest is expressed in the treated cell. In one embodiment, the treated cell is 
comprised in a tissue or organ in a living animal. 

The invention also provides a method for expressing a nucleic acid sequence of 
interest, comprising: a) providing: i) a cell; ii) a nucleic acid sequence of interest; iii) a 
promoter sequence; and iv) a fiinctional homolog of an age-related regulatory sequence 

15 selected from SEQ ID NO:93 and portions thereof; b) operably linking the nucleic acid 
sequence of interest, the promoter sequence, and the functional homolog to produce a 
transgene; and c) introducing the transgene into the cell to create a treated cell under 
conditions such that the nucleic acid sequence of interest is expressed in the treated cell. 
In addition to sequences from the hFIX gene, the invention provides sequences 

20 derived from the hPC gene. In particular, the invention provides substantially purified 

nucleic acid sequence comprising a nucleotide sequence selected from at least a portion of 
SEQ ID NO:85, and at least a portion of SEQ ID NO:92. In one embodiment, the portion 
has activity selected from age-related regulatory activity and regulatory activity. In an 
alternative embodiment, the portion is selected from SEQ ID NO:88, SEQ ID NO: 145, SEQ 

25 ID NO:146, SEQ ID NO:147. SEQ ID NO:148, SEQ ID NO:149, SEQ ID NO:150, SEQ ID 
NO: 151, SEQ ID NO: 152, SEQ ID NO: 153, SEQ ID NO: 154, SEQ ID NO: 155, SEQ ID 
NO:156, SEQ ID NO:157, SEQ ID NO:158, SEQ ID NO:159, SEQ ID NO:160, SEQ ID 
NO:161, SEQ ID NO:162; SEQ ID NO:163, SEQ ID NO:164, SEQ ID NO:165, SEQ ID 
NO:166, SEQ ID NO:167, SEQ ID NO:168, SEQ ID NO:169, SEQ ID NO:170, SEQ ID 

30 NO:l71, SEQ ID NO:172, SEQ ID NO:173, SEQ ID NO:87, SEQ ID NO:174, SEQ ID 
NO:175, SEQ ID NO:176, SEQ ID NO:177, SEQ ID NO:178, SEQ ID NO:179, SEQ ID 
NO:180, SEQ ID N0:181, SEQ ID NO:182, SEQ ID NO:183, SEQ ID NO:184, SEQ ID 
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NO: 185, SEQ ID NO: 186, SEQ ID NO: 187, SEQ ID NO: 188, SEQ ID NO: 189, SEQ ID 
NO:190, SEQ ID NO:191, SEQ ID NO:l92, SEQ ID NO:193, SEQ ID NO:194, SEQ ID 
NO: 195, SEQ ID NO: 196, SEQ ID NO: 197, SEQ ID NO: 198, SEQ ID NO: 199, SEQ ID 
NO:200, SEQ ID NO:201, SEQ ID NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID 
5 NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ID NO:208, SEQ ID NO:89, and SEQ ID 
NO:90. In another alternative embodiment, the portion is selected from SEQ ID NO:89 and 
SEQ ID NO:90, 

Also provided herein is a substantially purified nucleic acid sequence which 
hybridizes under stringent hybridization conditions with a nucleotide sequence selected from 

10 SEQ ID NO:92, the complement of SEQ ID NO:92, SEQ ID NO:85, the complement of 
SEQ ID NO:85, SEQ ID NO:89, the complement of SEQ ID NO:89, SEQ ID NO:90, and 
the complement of SEQ ID NO:90. In one embodiment, the nucleic acid sequence has 
activity selected from age-related regulatory activity and regulatory activity. 

The invention additionally provides a substantially purified nucleic acid sequence 

15 comprising a functional homolog of a sequence having activity selected from age-related 
regulatory activity and regulatory activity, the sequence selected from SEQ ID NO:92, a 
portion of SEQ ID NO:92, SEQ ID NO:85, a portion of SEQ ID NO:85, SEQ ID NO:89, 
and SEQ ID NO:90. 

Also provided by the invention is a recombinant expression vector comprising in 
20 operable combination i) a nucleic acid sequence of interest, ii) a promoter sequence, and iii) 
a nucleotide sequence having activity selected from age-related regulatory activity and 
regulatory activity, the nucleotide sequence selected from SEQ ID NO: 92, a portion of SEQ 
ID NO:92, SEQ ID NO:85, a portion of SEQ ID NO:85, SEQ ID NO:89, and SEQ ID 
NO:90. In one embodiment, the expression vector of Claim 38, wherein the nucleic acid 
25 sequence of interest encodes a protein selected from factor Vm, factor VII, factor IX, factor 
X, prothrombin, protein C, antithrombin III, tissue factor pathway inhibitor, LDL-receptor, 
human a 1 -antitrypsin, antithrombin HI, PEA-3 protein, p-galactosidase, and luciferase. In 
another embodiment, the promoter sequence is selected from human factor IX promoter, 
cytomegalovirus promoter, tRN A promoter, 5S rRNA promoters, histone gene promoters, 
30 RSV promoter, retrovirus LTR promoter, SV40 promoter, PEPCK promoter, MT promoter, 
SRa promoter, P450 family promoters, GAL7 promoter, T7 promoter, T3 promoter, SP6 
promoter, Kl 1 promoter, and fflV promoter. In yet another embodiment, the portion is 
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selected from SEQ ID NO:88, SEQ ID NO:145, SEQ ID NO:146, SEQ ID NO:147, SEQ ID 
NO:148, SEQ ID NO:149, SEQ ID NO:150, SEQ ID NO:151, SEQ ID NO:152. SEQ ID 
NO: 153, SEQ ID NO: 154, SEQ ID NO: 155. SEQ ID NO: 156, SEQ ID NO: 157, SEQ ID 
NO: 158, SEQ ID NO: 159, SEQ ID NO: 160, SEQ ID NO: 161, SEQ ID NO: 162, SEQ ID 
5 NO: 163, SEQ ID NO: 164. SEQ ID NO: 165, SEQ ID NO: 166. SEQ ID NO: 167, SEQ ID 
NO:168. SEQ ID NO:169. SEQ ID NO:170, SEQ ID NO:171, SEQ ID NO:172. SEQ ID 
NO: 173. SEQ ID NO:87, SEQ ID NO: 174, SEQ ID NO: 175. SEQ ID NO: 176, SEQ ID 
NO: 177. SEQ ID NO: 178, SEQ ID NO: 179, SEQ ID NO: 180, SEQ ID NO:181, SEQ ID 
NO: 182. SEQ ID NO: 183, SEQ ID NO: 184. SEQ ID NO: 185, SEQ ID NO: 186, SEQ ID 

10 NO: 187, SEQ ID NO: 188, SEQ ID NO: 1 89, SEQ ID NO: 190, SEQ ID NO: 191, SEQ ID 
NO:192, SEQ ID NO:193, SEQ ID NO:194, SEQ ID NO:195, SEQ ID NO:196, SEQ ID 
NO:197, SEQ ID NO:198, SEQ ID NO:199. SEQ ID NO:200, SEQ ID NO:201, SEQ ID 
NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID 
NO:207, SEQ ID NO:208. SEQ ID NO:89, and SEQ ID NO:90. In a further embodiment, 

15 the portion is selected from SEQ ID NO:89 and SEQ ID NO:90. 

The invention also provides a host cell containing a recombinant expression vector 
comprising in operable combination i) a nucleic acid sequence of interest, ii) a promoter 
sequence, and iii) a nucleotide sequence having activity selected from age-related regulatory 
activity and regulatory activity, the nucleotide sequence selected from SEQ ID NO:92, a 

20 portion of SEQ ID NO:92, SEQ ID NO:85. a portion of SEQ ID NO:85, SEQ ID NO:89, 

and SEQ ID NO:90. In one embodiment, the host cell is comprised in a tissue or organ in a 
living animal. In another embodiment, the host cell is a gamete. In a further embodiment, 
the host cell is selected from bacterial cell, yeast cell, plant cell, insect cell, and mammalian 
cell. 

25 

The invention also provides a recombinant expression vector comprising in operable 
combination i) a nucleic acid sequence of interest, ii) a promoter sequence, and iii) a 
functional homolog of a nucleotide sequence having activity selected from age-related 
regulatory activity and regulatory activity, the nucleotide sequence selected from SEQ ID 
30 NO:92, a portion of SEQ ID NO:92. SEQ ID NO:85, a portion of SEQ ID NO:85, SEQ ID 
NO:89, and SEQ ID NO:90. In one embodiment, the nucleic acid sequence of interest 
encodes a protein selected from factor VIII, factor VII, factor IX, factor X, prothrombin, 
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protein C, antithrombin III, tissue factor pathway inhibitor, LDL-receptor, human al- 
antitrypsin, antithrombin III, PEA-3 protein, p-galactosidase, and luciferase. In another 
embodiment, the promoter sequence is selected from human factor IX promoter, 
cytomegalovirus promoter, tRNA promoter, 5S rRNA promoters, histone gene promoters, 
5 RSV promoter, retrovirus LTR promoter, SV40 promoter, PEPCK promoter, MT promoter, 
SRa promoter, P450 family promoters, GAL7 promoter, promoter, Tj promoter, SP6 
promoter, Kl 1 promoter, and HIV promoter. In a further embodiment, the portion is 
selected from SEQ ID NO:88, SEQ ID NO:145, SEQ ID NO:146, SEQ ID NO:147, SEQ ID 
NO:148, SEQ ID NO:149, SEQ ID NO:150, SEQ ID N0:151, SEQ ID NO:152. SEQ ID 

10 NO:153, SEQ ID NO:154. SEQ ID NO:155, SEQ ID NO:156, SEQ ID NO:157, SEQ ID 
NO:158, SEQ ID NO:159, SEQ ID NO:160. SEQ ID NO:161, SEQ ID NO:162, SEQ ID 
NO:163, SEQ ID NO:I64, SEQ ID NO:165, SEQ ID NO:166, SEQ ID NO:167, SEQ ID 
NO: 168, SEQ ID NO: 169, SEQ ID NO: 170, SEQ ID NO: 171, SEQ ID NO: 172, SEQ ID 
NO: 173, SEQ ID NO:87, SEQ ID NO: 174, SEQ ID NO: 175, SEQ ID NO: 176, SEQ ID 

15 NO: 177, SEQ ID NO: 178, SEQ ID NO: 179, SEQ ID NO: 180, SEQ ID NO: 181, SEQ ID 
NO:182, SEQ ID NO:183, SEQ ID NO:l84, SEQ ID NO:185, SEQ ID NO:186, SEQ ID 
NO:187, SEQ ID NO:188, SEQ ID NO:189, SEQ ID NO:190, SEQ ID NO:19l, SEQ ID 
NO:192, SEQ ID NO:193, SEQ ID NO:I94, SEQ ID NO:195, SEQ ID NO:196, SEQ ID 
NO:197, SEQ ID NO:198; SEQ ID NO:199, SEQ ID NO:200, SEQ ID NO:201, SEQ ID 

20 NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID 
NO:207, SEQ ID NO:208, SEQ ID NO:89, and SEQ ID NO:90. In an alternative 
embodiment, the portion is selected from SEQ ID NO:89 and SEQ ID NO:90. 

Further provided herein is a host cell containing a recombinant expression vector 
comprising in operable combination i) a. nucleic acid sequence of interest, ii) a promoter 

25 sequence, and iii) a functional homblog of a nucleotide sequence having activity selected 

from age-related regulatory activity and regulatory activity, the nucleotide sequence selected 
from SEQ ID NO:92, a portion of SEQ ID NO:92, SEQ ID NO:85, a portion of SEQ ID 
NO:85, SEQ ID NO:89, and SEQ ID NO:90. In one embodiment, the host cell is comprised 
in a tissue or organ in a living animal. In an alternative embodiment, the host cell is a 

30 gamete. In yet another embodiment, the host cell is selected from bacterial cell, yeast cell, 
plant cell, insect cell, and mammalian cell. 
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Also provided herein are methods for using hPC sequences. In particulai", the 
invention discloses a method for expressing a nucleic acid sequence of interest, comprising: 
a) providing: i) a cell; ii) a nucleic acid sequence of interest; iii) a promoter sequence; and 
iv) a nucleotide sequence having activity selected from age-related regulatory activity and 
5 regulatory activity, the nucleotide sequence selected from SEQ ID NO:92, a portion of SEQ 
ID NO:92, SEQ ID NO:85, a portion of SEQ ID NO:85, SEQ ID NO:89, and SEQ ID 
NO:90; b) operably linking the nucleic acid sequence of interest, the promoter sequence, and 
the nucleotide sequence to produce a transgene; and c) introducing the transgene into the cell 
to create a treated cell under conditions such that the nucleic acid sequence of interest is 

10 expressed in the treated cell. In one embodiment, the treated cell is comprised in a tissue or 
organ in a living animal. 

Also provided herein is a method for expressing a nucleic acid sequence of interest, 
comprising: a) providing: i) a cell; ii) a nucleic acid sequence of interest; iii) a promoter 
sequence; and iv) a functional homolog of a nucleotide sequence having activity selected 

15 from age-related regulatory activity and regulatory activity, the nucleotide sequence selected 
from SEQ ID NO:92, a portion of SEQ ID NO:92, SEQ ID NO:85, a portion of SEQ ID 
NO:85, SEQ ID NO:89, and SEQ ID NO:90; b) operably linking the nucleic acid sequence 
of interest, the promoter sequence, and the frmctional homolog to produce a transgene; and 
c) introducing the transgene into the cell to create a treated cell under conditions such that 

20 the nucleic acid sequence of interest is expressed in the treated cell. 



-416FIXml (A), -416FIXml/L4 (B), -590FIXml (C), -679FIXml (D), and -770FIXml (E) 
expression vectors and which produce high initial prepubertal, but rapidly decreasing, hFIX 
expression levels with age. 

Figure 3 shows a Northern blot of human FIX mRNA levels (A) and a gel showing 
30 hFIX transgene DNA levels as determined by multiplex PGR analysis (B) in the livers and 
tails of animals carrying -416FIXml. 



BRIEF DESCRIPTION OF THE DRAWINGS 



25 



Figure 1 shows the structure of eleven exemplary human FIX minigene expression 
constructs and relative in vitro transient expression activities (ng hFIX/10^ cells/48 hr). 
Figure 2 shows graphs of longitudinal analyses of transgenic mice which, carry 
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Figure 4 shows graphs of longitudinal analysis of transgenic mice which carry 
-802FIXml (A), .802FIXml/1.4 (B), -2231FIXml (C), .2231FIXml/1.4 (D) and 
-416FIXml/AE5' (E) expression vectors and which produce hFIX at stable and increasing 
levels with age. 

5 Figure 5 shows a Northern blot of transgenic mice carrying -802FIXml and 

-802FIXml/1.4 expression vectors. 

Figure 6 is a gel of a gel electrophoretic mobility shift assay using mouse liver 
nuclear extract (NEs) from three different age groups, and using double-stranded 
oligonucleotides containing a PEA-3 element nucleotide sequence spanning from nt -797 to 
10 -776 of the hFIX gene (A), and using a competition assay for ^^P-labelled double stranded 
oligonucleotides containing the PEA-3 nucleotide sequence (B). 

Figure 7 is a Northern blot showing tissue specificity of hFIX expression in 
transgenic mice carrying -416FIXml (A) and -802 FIXml (B) expression vectors. 

Figure 8A-E shows the nucleotide sequence (SEQ ID NO:4) of, and eight amino acid 
15 sequences (SEQ ID NOs:5 to 12) which together form, the human factor IX (GenBank 

accession number K02402). The initiation transcription site (nucleotide 1) and the poly- A 
addition site (nucleotide 32,757) are identified by solid circles. The solid vertical arrows 
indicate the intron-exon splice junction. The five Alu repetitive sequences have been 
underlined, while the 5-base insert in intron A and the AATAAA sequence in exon VIII are 
20 boxed. The cleavage or termination site at^the 3' end of the gene (CATTG) is underlined 
with a dashed line. 

Figure 9 shows the cDNA sequence (SEQ ID NO: 13) (A) and encoded polypeptide 
sequence (SEQ ID NO:47) (B) of mouse PEA-3 (GenBank accession number X63190). 

Figure 10 A-D shows the cDNA sequence (SEQ ID NO:42) of the human al- 
25 antitrypsin gene (GenBank accession number K02212). 

Figure 1 1 shows the DNA sequence (SEQ ID NO:43) of human antithrombin III 
(GenBank accession number A06100). 

Figure 12 shows the cDNA sequence (A) (SEQ ID NO:49) (GenBank accession 
number X02750) and genonwc DNA sequence (B) (SEQ ID NO:50) (GenBank accession 
30 number Ml 1228) of human protein C. 

Figure 13 (A-E) shows the nucleic acid sequences (SEQ ID NOs:76-83) of exemplary 
homologs of AE3' (SEQ ID NO:3). 
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Figure 14 shows the nucleotide sequence (from nt -1462 to nt +1; SEQ ID NO:85) 
which is located at the 5 '-end of the human protein C gene. 

Figure 15 shows the structure of eight exemplary human protein C minigene 



expression constructs. 



5 



Figure 16 shows the relative in vitro transient expression activities for five exemplary 



human protein C minigene expression constructs. 

Figure 17 shows graphs of longitudinal analyses of transgenic mice which carry 
-1462hPCml (A), -82hPCml (B), and AE57-1462hPCml/AE3' (C) expression vectors. 

10 DEFINITIONS 

To facilitate understanding of the invention, a number of terms are defined below. 
The term "isolated" when used in relation to a nucleic acid, as in "an isolated nucleic 
• acid sequence" refers to a nucleic acid sequence that is identified and separated from at least 
one contaminant nucleic acid with which it is ordinarily associated in its natural state, or 

15 when obtained from its actual source. Isolated nucleic acid is nucleic acid present in a form 
or setting that is different from that in which it is found in natxire. In contrast, non-isolated 
nucleic acids are nucleic acids such as DNA and RNA which are found in the state they 
exist in nature. For example, a given DNA sequence (eg., a gene) is fovmd on the host cell 
chromosome in proximity to neighboring genes; RNA sequences, such as a specific mRNA 

20 sequence encoding a specific protein, are found in the cell as a mixture with numerous other 
mRNAs which encode a multitude of proteins. However, an isolated nucleic acid sequence 
comprising SEQ ID NO:l includes, by way of example, such nucleic acid sequences in cells 
which ordinarily contain SEQ ID NO:l where the nucleic acid sequence is in a chromosomal 
or extrachromosomal location different from that of natural cells, or is otherwise flanked by 

25 a different nucleic acid sequence than that found in nature. The isolated nucleic acid 

sequence may be present in single-stranded or double-stranded form. When an isolated 
nucleic acid sequence is to be utilized to express a protein, the nucleic acid sequence will 
contain (at a minimum) at least a portion of the sense or coding strand (i.e., the nucleic acid 
sequence may be single-stranded). Altematively, it may contain both the sense and anti- 

30 sense strands (i.e., the nucleic acid sequence may be double-stranded). 

As used herein, the term "purified" refers to molecules, either nucleic or amino acid 
sequences, that are removed from their natural environment, isolated or separated. An 
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"isolated nucleic acid sequence" is therefore a purified nucleic acid sequence. "Substantially 
purified" molecules are at least 60% free, preferably at least 75% free, and more preferably 
at least 90% free from other components with which they are naturally associated. 

The term "recombinant" when made in reference to a DNA sequence refers to a DNA 
5 sequence which is comprised of segments of DNA joined together by means of molecular 
biological techniques. The term "recombinant" when made in reference to a polypeptide 
sequence refers to a polypeptide sequence which is expressed using a recombinant DNA 
sequence. 

As used herein, the terms "vector" and "vehicle" are used interchangeably in reference 

10 to nucleic acid molecules that transfer DNA segment(s) from one cell to another. 

The term "expression vector" as used herein refers to a recombinant DNA molecule 
containing a desired coding sequence and appropriate nucleic acid sequences necessary for 
the expression of the operably linked coding sequence in a particular host organism. Nucleic 
acid sequences necessary for expression in prokaryotes include a promoter, optionally an 

1 5 operator sequence, a ribosome binding site and possibly other sequences. Eukaryotic cells 
are known to utilize promoters, enhancers, and termination and polyadenylation signals. 

The term "transgenic" when used in reference to a cell refers to a cell which contains 
a transgene, or whose genome has been altered by the introduction of a transgene. The term 
"transgenic" when used in reference to a tissue or animal refers to a tissue or animal, 

20 respectively, which comprises one or more cells that contain a transgene, or whose genome 
has been altered by the introduction of a transgene. Transgenic cells, tissues and animals 
may be produced by several methods including the introduction of a "transgene" comprising 
nucleic acid (usually DNA) into a target cell or integration of the transgene into a 
chromosome of a target cell by way of human intervention, such as by the methods 

25 described herein. 

A "non-human animal" refers to any animal which is not a human and includes 
vertebrates such as rodents, non^human primates, ovines, bovines, ruminants, lagomorphs, 
porcines, caprines, equines, canines, felines, aves, etc. Preferred non-human animals are 
selected from the order Rodentia. The term "order Rodentia" refers to rodents /.e., placental 

30 mammals (class Euthria) which include the family Muridae (e.g., rats and mice), most 
preferably mice. 
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The term "nucleotide sequence of interest" refers to any nucleotide sequence, the 
manipulation of which may be deemed desirable for any reason (e.g., treat disease, confer 
improved qualities, e/c), by one of ordinary skill in the art. Such nucleotide sequences 
include, but are not limited to, coding sequences of structural genes {e.g., reporter genes, 
5 selection marker genes, oncogenes, drug resistance genes, growth factors, etc.), and non- 
coding regulatory sequences which do not encode an mRNA or protein product {e.g., 
promoter sequence, polyadenylation sequence, termination sequence, enhancer sequence, 
etc.). 

As used herein, the terms "complementarity," or "complementary" are used in 
10 reference to nucleotide sequences related by the base-pairing rules. For example, the 

sequence 5*-AGT-3' is complementary to the sequence 5'-ACT-3'. Complementarity can be 
"partial" or "total." "Partial" complementarity is where one or more nucleic acid bases is not 
matched according to the base pairing rules. "Total" or "complete" complementarity between 
nucleic acids is where each and every nucleic acid base is matched with another base under 
15 the base pairing rules. The degree of complementarity between nucleic acid strands has 
significant effects on the efficiency and strength of hybridization between nucleic acid 
strands. 

A "complement" of a nucleic acid sequence as used herein refers to a nucleotide 
sequence whose nucleic acids show total complementarity to the nucleic acids of the nucleic 

20 acid sequence. 

The term "homology" when used in relation to nucleic acids refers to a degree of 
complementarity. There may be partial homology {i.e., partial identity) or complete 
homology (/.e., complete identity). A partially complementary sequence is one that at least 
partially inhibits a completely complementary sequence from hybridizing, to a target nucleic 

25 acid sequence and is refened to using the functional term "substantially homologous." The 
inhibition of hybridization of the completely complementary sequence to the target sequence 
may be examined using a hybridization assay (Southern or Northern blot, solution 
hybridization and the like) under conditions of low stringency. A substantially homologous 
sequence or probe {i.e., an oligonucleotide which is capable of hybridizing to another 

30 oligonucleotide of interest) will compete for and inhibit the binding {Le., the hybridization) 
of a completely homologous sequence to a target sequence under conditions of low 
stringency. This is not to say that conditions of low stringency are such that non-specific 
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binding is permitted; low stringency conditions require that the binding of two sequences to 
one another be a specific (/.e., selective) interaction. The absence of non-specific binding 
may be tested by the use of a second target which lacks even a partial degree of 
complementarity {e,g,^ less than about 30% identity); in the absence of non-specific binding 
5 the probe will not hybridize to the second non-complementary target. 

When used in reference to a double-stranded nucleic acid sequence such as a cDNA 
or genomic clone, the term "substantially homologous" refers to any probe which can 
hybridize to either or both strands of the double-stranded nucleic acid sequence under 
conditions of low stringency as described infra, 

10 When used in reference to a single-stranded nucleic acid sequence, the term 

"substantially homologous" refers to any probe which can hybridize to the single-stranded 
nucleic acid sequence under conditions of low stringency as described infra. 

The term "hybridization" as used herein includes "any process by which a strand of 
nucleic acid joins with a complementary strand through base pairing." [Coombs J (1994) 

15 Dictionary of Biotechnology, Stockton Press, New York NY]. Hybridization and the strength 
of hybridization (i.e., the strength of the association between the nucleic acids) is impacted 
by such factors as the degree of complementarity between the nucleic acids, stringency of the 
conditions involved, the T^ of the formed hybrid, and the G:C ratio within the nucleic acids. 
As used herein, the term "T„" is used in reference to the "melting temperature." The 

20 melting temperature is the temperature at which a population of double-stranded nucleic acid 
molecules becomes half dissociated into single strands. The equation for calculating the T^ 
of nucleic acids is well known in the art. As indicated by standard references, a simple 
estimate of the T^ value may be calculated by the equation: T^ = 81,5 + 0.41(% G + C), 
when a nucleic acid is in aqueous solution at 1 M NaCl [see e,g., Anderson and Young, 

25 Quantitative Filter Hybridization, in Nucleic Acid Hybridization (1985)]. Other references 
include more sophisticated computations which take structural as well as sequence 
characteristics fnto account for the calculation of T^, 

Low stringency conditions when used in reference to nucleic acid hybridization 
comprise conditions equivalent to binding or hybridization at 68®C in a solution consisting of 

30 5X SSPE (43.8 g/l NaCl, 6.9 g/1 NaH2P04*H20 and 1.85 g/l EDTA, pH adjusted to 7.4 with 
NaOH), 1% SDS, 5X Denhardt's reagent [50X Denhardt's contains the following per 500 
ml: 5 g Ficoll (Type 400, Pharmacia), 5 g BSA (Fraction V; Sigma)] and 100 [xg/ml 
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denatured salmon sperm DNA followed by washing in a solution comprising 0,2X SSPE, 
and 0.1% SDS at room temperature when a DNA probe of about 100 to about 1000 
nucleotides in length is employed. 

High stringency conditions when used in reference to nucleic acid hybridization 
5 comprise conditions equivalent to binding or hybridization at 68®C in a solution consisting of 
5X SSPE, 1% SDS, 5X Denhardt's reagent and 100 ng/ml denatured salmon sperm DNA 
followed by washing in a solution comprising O.IX SSPE, and 0.1% SDS at 68**C when a 
probe of about 100 to about 1000 nucleotides in length is employed. 

The term "equivalent" when made in reference to a hybridization condition as it 
10 relates to a hybridization condition of interest means that the hybridization condition and the 
hybridization condition of interest result in hybridization of nucleic acid sequences which 
have the same range of percent (%) homology. For example, if a hybridization condition of 
interest results in hybridization of a first nucleic acid sequence with other nucleic acid 
sequences that have from 50% to 70% homology to the first nucleic acid sequence, then 
15 another hybridization condition is said to be equivalent to the hybridization condition of 

interest if this other hybridization condition also results in hybridization of the first nucleic 
acid sequence with other nucleic acid sequences that have from 50% to 70% homology to 
the first nucleic acid sequence. 

When used in reference to nucleic acid hybridization the art knows well that 
20 numerous equivalent conditions may be employed to comprise either low or high stringency 
conditions; factors such as the length and nature (DNA, RNA, base composition) of the 
probe and nature of the target (DNA, RNA, base composition, present in solution or 
immobilized, etc.) and the concentration of the salts and other components (e^g., the presence 
or absence of formamide, dextran sulfate, polyethylene glycol) are considered and the 
25 hybridization solution may be varied to generate conditions of either low or high stringency 
hybridization different from, but equivalent to, the above-listed conditions. 

Those skilled in the art know that whereas higher stringencies may be preferred to 
reduce or eliminate non-specific binding of the nucleotide sequence of SEQ ID NOs: 1 or 3 
with other nucleic acid sequences, lower stringencies may be preferred to detect a larger 
30 number of nucleic acid sequences having different homologies to the nucleotide sequence of 
SEQ ID NOs: r and 3. 



- 21 - 



wo 00/75279 PCT/USOO/15728 
As used herein, the terms "regulatory element" and "regulatory sequence" 
interchangeably refer to a nucleotide sequence which does not encode RNA or a protein and 
which controls some aspect of the expression of nucleic acid sequences. For example, a 
promoter is a regulatory element which facilitates the initiation of transcription of an 
operably linked coding region. Other regulatory elements are splicing signals, 
polyadenylation signals, termination signals, etc. In contrast, the term "regulatory gene" 
refers to a DNA sequence which encodes RNA or a protein (e.^., transcription factor) that 
controls the expression of other genes. 

Regulatory elements may be tissue specific or cell specific. The term "tissue 
specific" as it applies to a regulatory element refers to a regulatory element that is capable of 
directing selective expression of a nucleotide sequence of interest to a specific type of tissue 
(e.g., liver) in the relative absence of expression of the same nucleotide sequence of interest 
in a different type of tissue {e.g,, lung). 

Tissue specificity of a regulatory element may be evaluated by, for example, operably 
linking a reporter gene to a promoter sequence (which is not tissue-specific) and to the 
regulatory element to generate a reporter construct, introducing the reporter construct into the 
genome of an aiiimal such that the reporter construct is integrated into every tissue of the 
resulting transgenic animal, and detecting the expression of the reporter gene (e.g., detecting 
mRNA, protein, or the activity of a protein encoded by the reporter gene) in different tissues 
of the transgenic animal. The detection of a greater level of expression of the reporter gene 
in one or more tissues relative to the level of expression of the reporter gene in other tissues 
shows that the regulatory element is "specific" for the tissues in which greater levels of 
expression are detected. Thus, the term "tissue-specific" {e.g., liver- specific) as used herein 
is a relative term that does not require absolute specificity of expression. In other words, the 
term "tissue-specific" does not require that one tissue have extremely high levels of 
expression and another tissue have no expression. It is sufficient that expression is greater in 
one tissue than another. By contrast, "strict" or "absolute" tissue-specific expression is meant 
to indicate expression in a single tissue type (e.g., liver) with no detectable expression in 
other tissues. 

The term "cell type specific" as applied to a regulatory element refers to a regulatory 
element which is capable of directing selective expression of a nucleotide sequence of 
interest in a specific type of cell in the relative absence of expression of the same nucleotide 
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sequence of interest in a different type of ceil within the same tissue. The term "cell type 
specific" when applied to a regulatory element also means a regulatory element capable of 
promoting selective expression of a nucleotide sequence of interest in a region within a 
single tissue. 

5 Cell type specificity of a regulatory element may be assessed using methods well 

known in the art, e,g,, immunohistochemical staining and/or Northern blot analysis. Briefly, 
for immunohistochemical staining, tissue sections are embedded in paraffin, and paraffin 
sections are reacted with a primary antibody which is specific for the polypeptide product 
encoded by the nucleotide sequence of interest whose expression is regulated by the 

10 regulatory element. A labeled (e^., peroxidase conjugated) secondary antibody which is 
specific for the primary antibody is allowed to bind to the sectioned tissue and specific 
binding detected (e.^., with avidin/biotin) by microscopy. Briefly, for Northern blot 
analysis, RNA is isolated from cells and electrophoresed on agarose gels to fractionate the 
RNA according to size followed by transfer of the RNA from the gel to a solid support, such 

15 as nitrocellulose or a nylon membrane. The immobilized RNA is then probed with a labeled 
oligo-deoxyribonucleotide probe or DNA probe to detect RNA species complementary to the. 
probe used. Northern blots are a standard tool of molecular biologists. 

The terrn "promoter," "promoter element," or "promoter sequence" as used herein, 
refers to a DNA sequence which when ligated to a nucleotide sequence of interest is capable 

20 of controlling the transcription of the nucleotide sequence of interest into mRNA. A 

promoter is typically, though not necessarily, located 5* (i.e., upstream) of a nucleotide 
sequence of interest whose transcription into mRNA it controls, and provides a site for 
specific binding by RNA polymerase and other transcription factors for initiation of 
transcription. 

25 Promoters may be constitutive or regulatable. The term "constitutive" when made in 

reference to a promoter means that the promoter is capable of directing transcription of an 
operably Unked nucleic acid sequence in the absence of a stimulus (e.^., heat shock, 
chemicals, etc.). In contrast, a "regulatable" promoter is one which is capable of directing a 
level of transcription of an operably linked nucleic acid sequence in the presence of a 

30 stimulus (e.g., heat shock, chemicals, etc.) which is different from the level of transcription 
of the operably linked nucleic acid sequence in the absence of the stimulus. 
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The terms "essentially consisting of and "consisting essentially of are equivalent 
terms, and when in reference to a nucleic acid sequence they are intended to refer to 
nucleotide sequences which contain from 50% to 100% of the nucleic acid bases which are 
present in the nucleic acid sequence, in which the arrangement of these nucleic acid bases 
5 with respect to each other in the nucleotide sequences is the same as their arrangement in the 
nucleic acid sequence, and in which the biological activity of the nucleotide sequences is 
from 50% to 100%, more preferably from 75% to 100%, and most preferably from 90% to 
100%, of the biological activity of the nucleic acid sequence. To illustrate, the term "a 
nucleic acid sequence consisting essentially of SEQ ID NO:l" refers to nucleotide sequences 

10 which contain from 50% to 100% of the nucleic acid bases which are present in SEQ ID 

NO:l, in which the arrangement of these nucleic acid bases with respect to each other in the 
nucleotide sequences is the same as their arrangement in SEQ ID NO:l, and in which the 
nucleotide sequences exhibit from 50% to 100%, more preferably from 75% to 100%, and 
most preferably from 90% to 100%, of the age-related regulatory activity, and/or of the 

15 liver-specific activity of SEQ ID NO:l . 

A "fiinctional homolog" of a nucleotide sequence which is derived from the hFIX 
gene shown in Figure 8 is defined as a nucleic acid sequence which has more than 50% 
identity and less than 100% identity with the hFIX-derived nucleotide sequence (/.e., with ^ 
the entire, or a portion of the, hFIX sequence of Figure 8), and which has age-related 

20 regulatory activity and/or liver-specific activity. For example, a functional homolog of SEQ 
ID NO:33 includes nucleic acid sequences which have more than 50% identity and less than 
100% identity with SEQ ID NO:33, and which have age-relafed regulatory activity and/or 
liver-specific activity. 

A "functional homolog" of a nucleotide sequence which is derived from the hPC 

25 sequence shown in Figure 14 is defined as a nucleic acid sequence which has more than 50% 
identity and less than 100% identity with the hPC-derived nucleotide sequence (i.e., with the • 
entire, or a portion of the, hPC sequence of Figure 14), and which has age-related regulatory 
activity and/or regulatory activity. For example, a fimctional homolog of SEQ ID NO:93 
includes nucleic acid sequences which have more than 50% identity and less than 100% 

30 identity with SEQ ID NO:93, and which have age-related regulatory activity and/or 
regulatory activity. 
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DESCRIPTION OF THE INVENTION 

The invention provides nucleic acid sequences which regulate expression of a 
nucleotide sequence of interest. In on embodiment, the invention provides nucleic acid 
sequences which regulate expression of a nucleotide sequence of interest in an age-related 
5 manner. Yet more particularly, the exemplary age-regulatory element 5' (AE5') has been 
discovered to regulate stable gene expression over time in vivo, while the exemplary age- 
regulatory element 3' (AE3') has been discovered to regulate increased gene expression over 
time in vivo. In another embodiment, the invention provides nucleic acid sequences (e^., 
AE5') which direct liver-specific expression of a gene of interest. In yet another 

10 embodiment, the invention provides transgenic animals which harbor the nucleic acid 

sequences provided herein and which expres a nucleotide sequence of interest in an age- 
related and/or liver-specific manner. The nucleic acid sequences provided herein are useful 
in, for example, identifying and isolating functional homologs of AE5' and AE3', and 
amplifying at least a portion of AE5' and AE3'. Importantly, the nucleic acid sequences of 

15 the invention are also useful in age-related expression and/or liver-specific expression of a 
nucleotide sequence of interest in an animal, in gene therapy, and in reducing expression of 
factor IX in an animal. 

In addition to regulatory sequences which are derived fi"om the human factor IX gene 
{e.g., AE5', AE3*, and AE3"), the invention further provides sequences which are derived 

20 from the human protein C (hPC) gene and which are characterized by age-related regulatory 
activity and regulatory activity. 

The invention is further discussed under (A) Regulatory Nucleic Acid Sequences, (B) 
Using Probes To Identify And Isolate Homologs Of AE5', AE3', and Of hPC-Derived 
Regulatory Sequences, (C) Using Primers to Amplify At Least A Portion Of AE5\ AE3', 

25 and Of hPC-Derived Regulatory Sequences, (D) Methods For Regulating Gene Expression, 
(E) Gene Therapy, and (F) Reducing Expression Of Factor IX In An Animal. 
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A. 



Regulatory Nucleic Acid Sequences 

The invention provides regulatory sequences which are derived from the hFIX and 



hPC genes. 



5 



i. 



Regulatory Nucleic Acid Sequences From The hFIX Gene 



The regulatory nucleic acid sequences of the invention and their surprising properties 
in regulating gene expression were discovered during the inventor's investigation of the 
mechanisms underlying age-associated regulation of the human factor IX, which is involved 
in blood coagulation. Blood coagulation plays a critical role not only in homeostasis, but 



O.D. Ratnoff and CD. Forbes, Eds., Sauders, Philadelphia, ed. 2 (1991), pp. 18-47; Kurachi 
et ai (1993) Blood Coagul. Fibrinol. 4:953-974]. Blood coagulation potential in humans as 
well as in other mammals reaches the young adult level around the age of weaning [Yao et 
al (1991) Thromb. Haemost. 65:52-58; Andrew et ai (1992) Blood 80:1998-2005; Andrew 

15 et al. Blood (1987) 70:165-172; Andrew et ai (1988) Blood 72:1651-1657]. This is 

followed by a gradual increase in coagulation potential during young adulthood, and an 
almost two-fold increase by old age [Sweeney and Hoemig (1993) Am. J. Clin. Pathol. 
99:687-688; Kurachi et al (1996) Thromb. Haemost. 76:965-969]. This age-associated 
increase in coagulation potential takes place in healthy centenarians [Marie et al (1995) 

20 Blood 85:3144-3149], indicating that the increase is a normal phenomenon associated with 



It is the inventors* consideration that this increase in coagulation potential may make 
a crucial contribution to the development and progression of age-associated diseases such as 
cardiovascular and thrombotic disorders [Conlan et al (1993) The Atherosclerosis Risk in 

25 Communities (ARIC) Study 70:380-385; Balleisen et al (1985) Thromb. Haemost. 54:475- 
479; Rode et al (1996) Nat. Med. 2:293-298; Woodward et al (1997) Brit. J. Haemat. 
97:785-797]. The inventors' consideration was based on the observation that this increase in 
blood coagulation potential coincides with plasma level increases of pro-coagulant factors 
such as factor IX, whereas plasma levels of anti-coagulation factors (such as antithrombin III 

30 and protein C) or of factors involved in fibrinolysis are only marginally affected [Conlan et 
al (1994) The Atherosclerosis Risk in Committees (ARIC) Study 72:551-556; Lowe et al 
(1997) Brit. J. Haemat. 97:775-784]. These facts strongly suggested to the inventors that the 



10 



also in many physiological and pathological conditions [Saito in Disorders of HemostasiSy . 



agmg. 
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observed increase in blood coagulation activity with advancing age is due to regulated 
events. Plasma levels of each protein factor involved iii blood coagulation, fibrinolysis and 
their regulatory systems are presumably determined by the balance of the many processes 
involved. At present, little is known about why an advancing age-associated increase in 
5 blood coagulation activity exists, or what molecular mechanisms are involved in age- 
dependent regulation (homeostasis) of blood coagulation [Finch in Longevity, Senescence, 
and the Genome, The University of Chicago Press, Chicago, 1990]. 

Blood coagulation factor IX (FIX) occupies a key position in the blood coagulation 
cascade where the intrinsic and extrinsic pathways merge [Saito in Disorders of Hemostasis, 

10 O.D. Ratnoff and CD. Forbes, Eds., Sauders, Philadelphia, ed. 2 (1991), pp. 18-47; Kurachi 
et al (1993) Blood Coagul. Fibrinol. 4:953-974]. FIX is synthesized in the liver with strict 
tissue-specificity, and its deficiency results in the bleeding disorder hemophilia B. In normal 
humans the plasma activity, and protein concentration levels of human FIX (hFIX) increase 
with advapcirig age [Sweeney and Hoemig (1993) Am. J. Clin. Pathol. 99:687-688; Kurachi 

15 et al, (1996) Thromb. Haemost. 76:965-969]. Mouse FIX (mFIX) plasma activity also 

increases with age in a manner similar to hFIX, and is directly correlated with an increase in 
liver mFIX messenger RNA (mRNA) level [Sweeney and Hoemig (1993) Am. J. Clin. 
Pathol. 99:687-688; Kurachi et al (1996) Thromb. Haemost. 76:965-969]. However, 
nothing else is known about the molecular mechanisms underlying such an increase. In 

20 investigating the basic molecular mechanisms responsible for age-associated regulation of 
hFIX, the inventors discovered the nucleotide sequences which regulate age-associated 
expression, and which direct liver-specific expression, of the exemplary hFIX gene. 

The discovery of the invention sequences was made possible, in part, by the 
inventors' use of the hFIX promoter in combination with the coding sequence for hFIX 

25 instead of with the coding sequence for commonly used reporter proteins. The discovery of 
the surprising functions of the nucleotide sequences provided herein was also made possible 
by the inventors' use of longitudinal in vivo analyses, rather than of in vitro analyses. In 
particular, the inventors' earlier studies used reporter genes (including bacterial P- 
galactosidase and chloramphenicol acetyltransferase [CAT]) which are heterologous to the 

30 factor IX promoter. In these earlier studies, the factor IX promoter showed only very weak 
expression activity in vitro [Kurachi et al. (1995) J. Biol. Chem. 270:5276-5281]. Use of 
such heterologous reporter genes made it impossible to reliably and quantitatively perform 
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longitudinal analyses of transgene expression in animals. The inventors unexpectedly 
observed that the use of hFIX minigene expression vectors which contained the hFIX 
promoter and its homologous hFIX gene were capable of producing high level plasma hFIX . 
in vivo. This unexpected observation not only solved the problems associated with the use 
5 of genes which are heterologous to the hFIX promoter by providing a reliable animal assay 
system, but also provided multiple unexpected critical insights into the regulatory 
mechanisms of .the hFIX gene, including the determination of nucleotide sequences which 
regulate the stability and age-related increased expression of the exemplary hFIX gene. 

The present invention provides the 32-nucleotide nucleic acid sequence 5*-agccatt 

10 cagtcgagga aggatagggt ggtat-3' (SEQ ID NO:l) of AE5* which corresponds to the sequence 
from 2164 to 2195 of the hFIX gene deposited in GenBank as accession number K02402, 
and which corresponds to the sequence from -802 to -771 of Figure 8 when in relation to the 
hFIX start codon (ATG) in which the adenine is designated as position +30. 

The present invention also provides the 1273 -nucleotide nucleic acid sequence (SEQ 

15 ID NO:3) (Figure 13) of AE3' which corresponds to the sequence from 34,383 to 35,655 of 
GenBank accession number K02402, and which corresponds to tlie sequence from 31,418 to 
32,690 of Figure 8 when in relation to the hFIX start codon (ATG) in which the adenine is 
designated as position +30. 

The terms "age-related regulatory activity" and "age-related activity" when made in 

20 reference to a nucleic acid sequence refer to the ability of the nucleic acid sequence to alter 
in an age-related manner (e^., increase over a period of time) the level of transcription into 
mRNA and/or the synthesis of a polypeptide encoded by a nucleotide sequence of interest 
which is operably linked to a promoter sequence as compared to the level of transcription 
into mRNA of the nucleotide sequence of interest which is operably linked to the promoter 

25 sequence in the absence of the nucleic acid sequence which has age-related regulatory 

activity. An "age regulatory sequence" is herein used to refer to a nucleic acid sequence 
which has age-related regulatory activity. 

To illustrate, where .expression levels of a gene of interest decrease over a period of . 
time, a nucleic acid sequence is said to have age-related regulatory activity if (when operably 

30 linked to the gene of interest) it results in (a) a smaller decrease in expression levels of the 
gene over the same period of time as compared to the decrease in expression levels in the 
absence of the nucleic acid sequence, (b) relatively constant (i.e., unchanged) expression 
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levels over the same period of time, or (c) increased expression levels over the same period 
of time. 

The terms "operably linked," "in operable combination," and "in operable order" as 
used herein refer to the linkage of nucleic acid sequences such that they perform their 
5 intended function. For example, operably linking a promoter sequence to a nucleotide 

sequence of interest refers to linking the promoter sequence and the nucleotide sequence of 
interest in a manner such that the promoter sequence is capable of directing the transcription 
of the nucleotide sequence of interest and/or the synthesis of a polypeptide encoded by the 
nucleotide sequence of interest. Similarly, operably linking a nucleic acid sequence having 

10 age-related regulatory activity to a promoter sequence and to a nucleotide sequence of 

interest means linking the nucleic acid sequence having age-related regulatory activity, the 
promoter sequence and the nucleotide sequence of interest in a manner such that the nucleic 
acid sequence having age-related regulatory activity is capable of altering over a period of 
time the level of transcription into mRNA of the nucleotide sequence of interest and/or the 

1 5 synthesis of a polypeptide encoded by the nucleotide sequence of interest. 

Methods for determining age-related regulatory activity of a candidate nucleic acid 
sequence, given the teachings of the present specification, are within the ordinary skill in the 
art and are exemplified by the methods disclosed herein. For example, a test vector is 
constructed in which the candidate nucleic acid sequence is linked upstream or downstream 

20 of a promoter sequence which is operably linked to a nucleotide sequence of interest (e.g.. 
Example 1). A control vector which is similar to the test vector but which lacks the 
candidate nucleic acid sequence is also constructed. The test vector and control vector are 
separately introduced into a host cell. It is preferred that the host cell (e.g., fertilized egg) 
be capable of generating a transgenic multicellular organism, e.g., a transgenic mouse (e.g., 

25 Example 3) and that transgenic multicellular organisms are generated. Longitudinal analyses 
of the expression of mRNA which is encoded by the nucleotide sequence of interest (e.g., by 
Northem blot hybridization) over a period of time in, and preferably over the entire life span 
of, founders and successive generations of the transgenic multicellular organism are carried 
out (e.g.. Example 3). The detection in any tissue of mRNA and/or protein levels which are 

30 encoded by the nucleotide sequence of interest and which are greater in transgenic animals 
harboring the test vector as compared to the mRNA and/or protein levels in transgenic 
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animals harboring the control vector at least one point in time indicates that the candidate 
nucleic acid sequence has age-related regulatory activity. 

For example, evidence provided herein shows the surprising resuh that AE5' (SEQ 
ID NO:l) alone has age-related regulatory activity in that AE5' stabilizes hFIX mRNA 
5 whereby hFIX mRNA levels are essentially unchanged at different time points over the 

entire life span of transgenic animals (Figure 4, A, C and E) as compared to the declining 
hFIX mRNA levels in transgenic animals which harbor vectors that lack AE5* (Figures 2 A 
and 2E). The age-related regulatory activity of AE5* was observed regardless whether AE5* 
was placed upstream (Figure 4A) or downstream (Figure 4E) of the promoter sequence in 

10 the expression construct. 

Furthermore, data provided herein demonstrates the unexpected result that AE3' (SEQ 
ID NO:3) alone has age-related regulatory activity in that AE3' increases hFIX mRNA at 
several time points during the life of transgenic animals (Figure 2B) relative to the hFIX 
mRNA levels at the same time points in transgenic animals harboring vectors that lack AE3' 

15 (Figure 2A). AE3' substantially increased the steady state hFIX mRNA levels (Figure 5). 

This result which was observed in vivo was surprising in part because AE3' exhibited weak 
down regulatory effects on hFIX production in vitro. Without limiting the invention to any 
particular mechanism, these results suggest that AE3' functions by increasing hFIX mRNA 
stability which directly correlates with an increase in the hFIX protein level in the 

20 circulation. Also without intending to limit the invention to any particular theory, it is the 
inventors' consideration that the age-related regulatory activity of AE3' is due to the si 
structure-forming dinucleotide repeats present in the 3'UTR; the si region is the 103 bp 
sequence (SEQ ID NO:61) from nt 32,141 through nt 32,243 of Figure 8. This" 
consideration is based on the inventors* observation that dinucleotide repeats, such as (AT)„ 

25 of the 3' UTR of various genes, can form si structures in mRNA, which upon binding 

specific proteins are known to modulate mRNA stability, mostly to a less stable state [Ross 
(1995) Microbiol. Rev. 59:423-450]. 

Importantly, the invention demonstrates the surprising synergistic action of AE5' and 
AE3' which together result in hFIX mRNA levels which not only are greater at each time 

30 point tested over the life span of transgenic animals (Figures 4 B and D) as compared to 

hFIX mRNA levels in transgenic animals harboring vectors that lack both AE5' and AE3', 
but also that the profile of increased human FIX mRNA levels over the life span of 
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transgenic mouse recapitulates the profile of increased mouse FIX mRNA levels, as a wild- 
type mouse ages. 

Data presented herein further demonstrate that the age-related regulatory activity of 
AE5* alone, of AE3* alone, and of the combination of AE5' and AE3* is independent of the 
level of expression of the transgenes harboring them, sex, generation or zygosity status of the 
transgenic animals. 

The present invention is not limited to SEQ ID NOs:l and 3 but specifically 
contemplates portions thereof As used herein the term "portion" when made in reference to 
a nucleic acid sequence refers to a fragment of that sequence. The fragment may range in 
size from five (5) contiguous nucleotide residues to the entire nucleic acid sequence minus 
one nucleic acid residue. Thus, a nucleic acid sequence comprising "at least a portion of a 
nucleotide sequence comprises fi-om five (5) contiguous nucleotide residues of the nucleotide 
sequence to the entire nucleotide sequence. 

* In a preferred embodiment, portions of SEQ ID NO:l contemplated to be within the 
scope of the invention include, but are not limited to, the 7-nucleotide nucleic acid sequence 
of the Bolyomavirus enhance activator 3 (PEA-3) (5'-GAGGAAG-3') (SEQ ID NO:2) which 
corresponds to the sequence from 2176 to 2182 of GenBank accession number K02402, and 
which corresponds to the sequence from -790 to -784 of GenBank accession number K02402 
when in relation to the hFIX start codon (ATG) in which the adenine is designated as 
position +30. A nucleotide sequence [5'-CAGGAAG-3' (SEQ ID NO:40)] which is 
homologous to the invention's PEA-3 nucleotide sequence was initially reported in the art as 
a polyoma virus enhancer, and was reported to be involved in the regulation of expression of 
various genes (e.g., collagen gene and c-fos) in several tissues [Martin et ai (1988) Proc. 
Natl. Acad. Sci. 85:5839-5843; Xin et al (1992) Genes & Develop. 6:481-496; Chotteau- 
Lelievre et a/.(1997) Oncogene 15:937-952; Gutman and Wasylyk (1990) EMBO J. 9:2241- 
2246]. However, the PEA-3 protein sequence [or PEA-3 related protein(s)] which binds to 
nucleotide sequences which are homologous to the invention's PEA-3 nucleotide sequence 
has not been reported to be either liver-specific or enriched in the liver. 

Other portions of SEQ ID NO:l included within the scope of the invention include, 
for example, SEQ ID NO:33 [5*-tcgaggaagga-3'], SEQ ID NO:34 [5'-agtcgaggaaggata-3'], 
SEQ ID NO:35 [5'-tcagtcgaggaaggatagg-3'], SEQ ID NO:36 

[5'-attcagtcgaggaaggatagggt-3'], SEQ ID NO:37 [5'-ccattcagtcgaggaaggatagggtgg-3'], and 
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SEQ ID NO:38 [5'-gccattcagtcgaggaaggatagggtggta-3'], all of which include the PEA-3 
nucleotide sequence. 

In a preferred embodiment, portions of SEQ ID N0:3 contemplated to be within the 
scope of the invention include, but are not limited to, SEQ ID NO:51 [5'- 
5 TTATTTTATATATATAATATATATATAAAATA-3'], SEQ ID NO:52 [5'-TAT 

AATATA-3*], SEQ ID NO:53 [5'-CAATATAAATATATAG-3'], SEQ ID NO:54 [5'- 
TGTGTGTGTATGCGTGTGTGTAGACACACACGCATACACACATA-3 the 
combination of SEQ ID N0s:51 and 52, i.e., SEQ ID NO:55 [5*-TTATTTTATA 
TATATAATATATATATAAAATATATAATATA-3'], the combination of SEQ ID NOs:52 

10 and 53. i.e., SEQ ID NO:56 [5'-TATAATATACAATATAAATATATAG-3'], the 
combination of SEQ ID NOs:53 and 54, /.e., SEQ ID NO:57 [5'-CAATATAAAT 
ATATAGTGTGTGTGTATGCGTGTGTGTAGACACACACGCATACACACATA-3'], the 
combination of SEQ ID NOs:51, 52, 53, and 54, le,, SEQ ID NO:58 [5'-TTAT 
TTTATATATATAATATATATATAAAATATATAATATACAATATAAATATATAGTGT 

15 GTGTGTATGCGTGTGTGTAGACACACACGCATACACACATA-3*], the 723 bp sequence 
(SEQ ID NO:59) from nt 31,418 through nt 32,140 of Figure 8, the 447 bp sequence (SEQ 
ID NO:60) from nt 32,244 through nt 32,690 of Figure 8, and the 103 bp sequence (SEQ ID 
NO:61) (/.e., the si region of the 3' UTR) from nt 32,141 through nt 32,243 of Figure 8. 
The nucleotide sequence of portions of SEQ ID NOs:l and 3 which exhibit age- 

20 related regulatory activity may be determined using methods known in the art, e,g.y using 
deletion constructs (e.g., see Yang et aL (1998) J. Biol. Chem. 273:891-897). Briefly, 
several expression plasmids are constructed to contain a reporter gene under the control of a 
promoter and of different candidate nucleotide sequences which are obtained either by 
restriction enzyme deletion of internal sequences in SEQ ID NOs:l and 3, restriction enzyme 

25 truncation of sequences at the 5' and/or 3' end of SEQ ID NOs:l and 3, by the introduction 
of single nucleic acid base changes by PGR into SEQ ID NOs:l and 3, or by chemical 
synthesis. The gene-related regulatory activity of the different constructs is determined as 
described supra in order to determine whether the candidate nucleotide sequence exhibits 
age-related regulatory activity. 

30 The sequences of the present invention are not limited to SEQ ID NOs: 1 and 3 and 

portions thereof, but also include homologs of SEQ ID NOs:l and 3, and homologs of 
portions thereof. Homologs of SEQ ID NOs:l. and 3, and of portions thereof, include, but 
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are not limited to, nucleotide sequences having deletions, insertions, or substitutions of 
different nucleotides or nucleotide analogs as compared to SEQ ID NOs:l and 3, and of 
portions thereof, respectively. Such homologs may be produced using methods well known 
in the art. 

5 A "homolog" of SEQ ID NO:l is defmed as a nucleotide sequence having more than 

63% identity and less than 100% identity with SEQ ID NO:l. Homologs of SEQ ID N0:1 
are exemplified, but not limited to, SEQ ID NO:66 (5'-acccatt cagtcgagga aggatagggt ggtat- 
3') which is the sequence from nt 2,164 to nt 2,195 of GenBank accession number k02402, 
except that the G at nt 2,165 is replaced with a C; SEQ ID NO:67 (5'-agccatt gagtcgagga 

10 aggatagggt ggtat-3') which is the sequence from nt 2,164 to nt 2,195 of GenBank accession 
number k02402, except that the C at nt 2,171 is replaced with a G; SEQ ID NO:68 (5*- 
agccatt cagacgagga aggatagggt ggtat-3') is the sequence from nt 2,164 to nt 2,195 of 
GenBank accession number k02402, except that the T at nt 2,174 is replaced with a A; SEQ 
ID NO:69 (5 '-agccatt cagtcgagga aggatagggt ggttt-3') which is the sequence from nt 2,164 to 

15 nt 2,195 of GenBank accession number k02402, except that the A at nt 2,194 is replaced 

with a T; SEQ ID NO:70 (5'-agccatt cagtcgagga tcccaagggt ggtat-3') which is the sequence 
from nt 2,164 to nt 2,195 of GenBank accession number k02402, except that AGGGT 
beginning at nt 2,186 is replaced with TCCCA; SEQ ID NO:71 (5 '-agccatt cagtcgagga 
aggatagggcctaat-3') which is the sequence from nt 2,164 to nt 2,195 of GenBank accession 

20 number k02402, except that TGGT beginnining at nt 2,190 is replaced with CCTA; SEQ ID 
NO:72 (5*-agaccatt cagtcgagga aggatagggt ggtat-3') which is the sequence from nt 2,164 to 
nt 2,195 of GenBank accession number k02402, except that a A is inserted after nt 2,165; 
SEQ ID NO:73 (5'-agccatt cagtcgagga aggatagcggt ggtat-3') which is the sequence from nt 
2,164 to nt 2,195 of GenBank accession number k02402, except that a C is inserted after nt 

25 2,187; SEQ ID NO:74 (5'-agccatt cagtcgagga aggataat-3') which is the sequence from nt 

2,164 to nt 2,195 of GenBank accession number k02402, except that GGGTGGT beginning 
at nt 12,187 is deleted; and SEQ ID NO:75 (5 '-agccatt cgagga aggatagggt ggtat-3') which is 
the sequence from nt 2,164 to nt 2,195 of GenBank accession number k02402, except that 
CAGT beginning at nt 2,171 is deleted. 

30 A "homolog" of SEQ ID NO:2 is defined as a nucleotide sequence having more than 

75% identity and less than 100% identity with SEQ ID NO:2. Homologs of SEQ ID NO:2 
include, for example, GAGGATG (SEQ ID NO:39), CAGGAAG (SEQ ID NO:40), 
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. CAGGATG (SEQ ID NO:41), GTGGAAG (SEQ ID NO:62), GTGGATG (SEQ ID NO:63), 
CTGGAAG (SEQ ID NO:64), GTGGATG (SEQ ID NO:65), and CAGGAAG (SEQ ID 
NO:84). 

A "homolog" of SEQ ID N0:3 is defined as a nucleotide sequence having less than 
5 100% and more than 60% identity with SEQ ID NO:3. Homologs of SEQ ID NO:3 are 

exemplified, but not limited to, SEQ ID NOs:76-83 shown in Figure 13. Specifically, SEQ 
ID NO:76 is the sequence firom nt 34,383 to nt 35,655 of GenBank accession number 
k02402, except that the C at nt 34,390 has been replaced with a G. SEQ ID NO: 77 is the 
sequence from nt 34,383 to nt 35,655 of GenBank accession number k02402, except that the 

10 T at nt 34,649 has been replaced with a A. SEQ ID NO:78 is the sequence fi-om nt 34,383 
to nt 35,655 of GenBank accession number k02402, except that the GC beginning at nt 
34,959 has been replaced with a CG. SEQ ID NO:79 is the sequence from nt 34,383 to nt 
35,655 of GenBank accession number k02402, except that the CATG beginning at nt 35,501 
has been replaced with a GTAC. SEQ ID NO:80 is the sequence firom nt 34,383 to nt 

15 35,655 of GenBank accession number k02402, except that TT is inserted after the A at nt 
34,681. SEQ ID NO:81 is the sequence fi-om nt 34,383 to nt 35,655 of GenBank accession 
number k02402, except that TGC is inserted after the C at nt 35,581. SEQ ID NO:82 is the 
sequence from nt 34,383 to nt 35,655 of GenBank accession number k02402, except that A 
at nt 35,636 is deleted. SEQ ID NO:83 is the sequence firom nt 34,383 to nt 35,655 of 

20 GenBank accession number k02402, except that the G at nt 34,383 is deleted. 

A "homolog" of SEQ ID NO:59 is defined as a nucleotide sequence haviiig less than 
100% and more than 62% identity with SEQ ID NO:59. 

A "homolog" of SEQ ID NO:60 is defined as a nucleotide sequence having less than 
100% and more than 60% identity with SEQ ID NO:60. 

25 A "homolog" of SEQ ID NO:61 is defined as a nucleotide sequence having less than 

100% and more than 60% identity with SEQ ID NO:61. 

Homologs of a portion of SEQ ID NO:l are exemplified by homologs of the PEA-3 
nucleotide sequence (SEQ ID NO:2), which include, for example, GAGGATG (SEQ ID 
NO:39), CAGGAAG (SEQ ID NO:40), CAGGATG (SEQ ID NO:41), GTGGAAG (SEQ ID 

30 NO:62), GTGGATG (SEQ ID NO:63), CTGGAAG (SEQ ID NO:64), CTGGATG (SEQ ID 
NO:65), and CAGGAAG (SEQ ID NO:84). 



- 34 - 



wo 00/75279 PCT/USOO/15728 
The present invention also contemplates functioning or functional homologs of SEQ 
ID NO:l, of portions of SEQ ID NO:l (e.g., functional portions of SEQ ID NOs:2, and 33- 
38), of SEQ ID NO:3, and of portions of SEQ ID NO:3 (e.g., functional portions of SEQ ID 
NOs:51-61). 

5 A "functional homolog" of SEQ ID NO:l is defined as a nucleotide sequence having 

more than 63% identity and less than 100% identity with SEQ ID NO:l, and which has age- 
related regulatory activity. Alternatively, a functional homolog of SEQ ID NO:l is a 
nucleotide sequence having more than 63% identity and less than 100% identity with SEQ 
ID N0:1, and having liver-specific activity. 

10 A "functional homolog" of SEQ ID NO:2 is defined as a nucleotide sequence having 

more than 75% identity and less than 100% identity with SEQ ID NO:2, and which has age- 
related regulatory activity. Alternatively, a functional homolog of SEQ ID NO:2 is a 
nucleotide sequence having more than 75% identity and less than 100% identity with SEQ 
ID N0:2, and having liver-specific activity. 

15 A "functional homolog" of SEQ ID NO:3 is defined as a nucleotide sequence having 

less than 100% and more than 60% identity with SEQ ID NO:3, and which has age-related 
regiilatory activity, 

A "functional homolog" of SEQ ID NO:59 is defined as a nucleotide sequence having 
less than 100% and more than 62% identity with SEQ ID NO:59, and which has age-related 
20 regulatory activity. 

A "functional homolog" of SEQ ID NO:60 is defined as a nucleotide sequence having 
less than 100% and more than 60% identity with SEQ ID NO:60, and which has age-related 
regulatory activity. 

A "functional homolog" of SEQ ID NO:61 is defined as a nucleotide sequence having 
25 less than 100% and more than 60% identity with SEQ ID NO:61, and which has age-related 
regulatory activity. 

The present invention is not limited to sense molecules of SEQ ID NOs: 1 and 3 hut 
contemplates within its scope antisense molecules comprising a nucleic acid sequence 
complementary to at least a portion (e.g.y a portion greater than 10 nucleotide bases in length 
30 and more preferably greater than 100 nucleotide bases in length) of the nucleotide sequence 
of SEQ ID NOs:l and 3. These antisense molecules find use in, for example, reducing or 
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preventing expression of a gene (e,g, hFIX) whose expression is regulated by SEQ ID NOs:l 
and 3. 

The invention further provides the nucleotide sequence AE3" which is a preferred 
portion of AE3\ AE3" [5'-ttgggg gaaaagtttc tttcagagag ttaagttatt ttatatatat aatatatata 



atggaagcaa taagccat-3'; (SEQ ID NO:93)] is the 154-nucleotide nucleic acid sequence from 
35,075 to 35,228 of GenBank accession number K02402, which corresponds to the sequence 
from. 32,1 10 to 32,263 of Figure 8 when in relation to the hFIX start codon (ATG) in which 
the adenine is designated as position +30. AE3*' contains a 102-bp stem-loop forming 

10 sequence (SEQ ID NO:91). 

Data presented herein demonstrates the universality of the regulatory function of 
portions of AE3' in that AE3'' has been successfully used to regulate expression of the 
heterologous hPC gene in an age-related manner. In particular, transgenic animals 
containing the -1462hPCml construct expressed age-stable levels of human protein C, i.e., 

15 expressed relatively constant levels of human protein C at different time points during the 
life span of the transgenic animals (Example 12, Figure 17A). In direct contrast, the 
presence of AE5' and AE3" sequences resulted in increased expression levels of human 
protein C over time (Figure 1 7C). These results confirm the universality of the function of 
the AE3" portion of AE3' in regulating expression of operably linked genes in an age- 

20 related manner. 

The invention also contemplates portions of AE3'\ These portions include, but are 
not limited to, SEQ ID NOs:94-144 wherein SEQ ID'nO:94 is 5'-tgggg gaaaagtttc 
tttcagagag ttaagttatt ttatatatat aatatatata taaaatatat aatatacaat ataaatatat agtgtgtgtg tgtatgcgtg 
tgtgtagaca cacacgcata cacacatata atggaagcaa taagccat-3'; SEQ ID NO:95 is 5'-gggg 

25 gaaaagtttc tttcagagag ttaagttatt ttatatatat aatatatata taaaatatat aatatacaat ataaatatat agtgtgtgtg 

tgtatgcgtg tgtgtagaca cacacgcata cacacatata atggaagcaa taagccat-3'; SEQ ID NO:96 is 5'-ggg 
gaaaagtttc tttcagagag ttaagttatt ttatatatat aatatatata taaaatatat aatatacaat ataaatatat agtgtgtgtg. 
tgtatgcgtg tgtgtagaca cacacgcata cacacatata atggaagcaa taagccat-3'; SEQ ID NO: 97 is 5'-gg 
gaaaagtttc tttcagagag ttaagttatt ttatatatat aatatatata taaaatatat aatatacaat ataaatatat agtgtgtgtg 

30 tgtatgcgtg tgtgtagaca cacacgcata cacacatata atggaagcaa taagccat-3'; SEQ ID NO:98 is 5'-g 
gaaaagtttc tttcagagag ttaagttatt ttatatatat aatatatata taaaatatat aatatacaat ataaatatat agtgtgtgtg 
tgtatgcgtg tgtgtagaca cacacgcata cacacatata atggaagcaa taagccat-3'; SEQ ID NO:99 is 5'- . 



5 



taaaatatat aatatacaat ataaatatat agtgtgtgtg tgtatgcgtg tgtgtagaca cacacgcata cacacatata 
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gaaaagtttc tttcagagag ttaagttatt ttatatatat aatatatata taaaatatat aatatacaat ataaatatat agtgtgtgtg 
tgtatgcgtg tgtgtagaca cacacgcata cacacatata atggaagcaa taagccat-3'; SEQ ID NO: 100 is 
5*-aaaagtttc tttcagagag ttaagttatt ttatatatat aatatatata taaaatatat aatatacaat ataaatatat agtgtgtgtg 
tgtatgcgtg tgtgtagaca cacacgcata cacacatata atggaagcaa taagccat-3'; SEQ ID NO: 101 is 
S'-aaagtttc tttcagagag ttaagttatt ttatatatat aatatatata taaaatatat aatatacaat ataaatatat agtgtgtgtg 
tgtatgcgtg tgtgtagaca cacacgcata cacacatata atggaagcaa taagccat-3*;. SEQ ID NO: 102 is 
S'-aagtttc tttcagagag ttaagttatt ttatatatat aatatatata taaaatatat aatatacaat ataaatatat agtgtgtgtg 
tgtatgcgtg tgtgtagaca cacacgcata cacacatata atggaagcaa taagccat-3'; SEQ ID NO: 103 is 
5'*agtttc tttcagagag ttaagttatt ttatatatat aatatatata taaaatatat aatatacaat ataaatatat agtgtgtgtg 
tgtatgcgtg tgtgtagaca cacacgcata cacacatata atggaagcaa taagccat-3'; SEQ ID NO: 104 is 
5'-gtttc tttcagagag ttaagttatt ttatatatat aatatatata taaaatatat aatatacaat ataaatatat agtgtgtgtg 
tgtatgcgtg tgtgtagaca cacacgcata cacacatata atggaagcaa taagccat-3'; SEQ ID NO: 105 is 5'-tttc 
tttcagagag ttaagttatt ttatatatat aatatatata taaaatatat aatatacaat ataaatatat agtgtgtgtg tgtatgcgtg 
tgtgtagaca cacacgcata cacacatata atggaagcaa taagccat-3'; SEQ ID NO: 106 is 5'-ttc tttcagagag 
ttaagttatt ttatatatat aatatatata taaaatatat aatatacaat ataaatatat agtgtgtgtg tgtatgcgtg tgtgtagaca 
cacacgcata cacacatata atggaagcaa taagccat-3'; SEQ ID NO:107 is 5'-tc tttcagagag ttaagttatt 
ttatatatat aatatatata taaaatatat aatatacaat ataaatatat agtgtgtgtg tgtatgcgtg tgtgtagaca cacacgcata 
cacacatata atggaagcaa taagccat-3'; SEQ ID NO: 108 is 5'-c tttcagagag ttaagttatt ttatatatat 
aatatatata taaaatatat aatatacaat ataaatatat agtgtgtgtg tgtatgcgtg tgtgtagaca cacacgcata cacacatata 
atggaagcaa taagccat-3'; SEQ ID NO:109 is 5'-tttcagagag ttaagttatt ttatatatat aatatatata 
taaaatatat aatatacaat ataaatatat agtgtgtgtg tgtatgcgtg tgtgtagaca cacacgcata cacacatata 
atggaagcaa taagccat-3'; SEQ ID NO: 110 is 5'-ttcagagag ttaagttatt ttatatatat aatatatata 
taaaatatat aatatacaat ataaatatat agtgtgtgtg tgtatgcgtg tgtgtagaca cacacgcata cacacatata 
atggaagcaa taagccat-3'; SEQ ID NO:lll is 5'-tcagagag ttaagttatt ttatatatat aatatatata taaaatatat 
aatatacaat ataaatatat agtgtgtgtg tgtatgcgtg tgtgtagaca cacacgcata cacacatata atggaagcaa 
taagccat-3'; SEQ ID NO:112 is 5'-cagagag ttaagttatt ttatatatat aatatatata taaaatatat aatatacaat 
ataaatatat agtgtgtgtg tgtatgcgtg tgtgtagaca cacacgcata cacacatata atggaagcaa taagccat-3'; SEQ 
ID N0:113 is S'-agagag ttaagttatt ttatatatat aatatatata taaaatatat aatatacaat ataaatatat agtgtgtgtg 
tgtatgcgtg tgtgtagaca cacacgcata cacacatata atggaagcaa taagccat-3'; SEQ ID NO:l 14 is 
5'-gagag ttaagttatt ttatatatat aatatatata taaaatatat aatatacaat ataaatatat agtgtgtgtg tgtatgcgtg 
tgtgtagaca cacacgcata cacacatata atggaagcaa taagccat-3'; SEQ ID NO: 115 is 5'-agag ttaagttatt 
ttatatatat aatatatata taaaatatat aatatacaat ataaatatat agtgtgtgtg tgtatgcgtg tgtgtagaca cacacgcata 
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cacacatata atggaagcaa taagccat-3'; SEQ ID NO: 116 is 5 '-gag ttaagttatt ttatatatat aatatatata 
taaaatatat aatatacaat ataaatatat agtgtgtgtg tgtatgcgtg tgtgtagaca cacacgcata cacacatata 



aatatacaat ataaatatat agtgtgtgtg tgtatgcgtg tgtgtagaca cacacgcata cacacatata atggaagcaa 
taagccat-3*; SEQ ID NO: 118 is 5'-g ttaagttatt ttatatatat aatatatata taaaatatat aatatacaat 
ataaatatat agtgtgtgtg tgtatgcgtg tgtgtagaca cacacgcata cacacatata atggaagcaa taagccat-3'; SEQ 
ID NO: 119 is 5*-ttaagttatt ttatatatat aatatatata taaaatatat aatatacaat ataaatatat agtgtgtgtg 
tgtatgcgtg tgtgtagaca cacacgcata cacacatata atggaagcaa taagccat-3'; SEQ ID NO: 120 is 
5'-taagttatt ttatatatat aatatatata taaaatatat aatatacaat ataaatatat agtgtgtgtg tgtatgcgtg tgtgtagaca 
cacacgcata cacacatata atggaagcaa taagccat-3'; SEQ ID NO: 121 is 5'-aagttatt ttatatatat 
aatatatata taaaatatat aatatacaat ataaatatat agtgtgtgtg tgtatgcgtg tgtgtagaca cacacgcata cacacatata 
atggaagcaa taagccat-3'; SEQ ID NO: 122 is 5'-agttatt ttatatatat aatatatata taaaatatat aatatacaat 
ataaatatat agtgtgtgtg tgtatgcgtg tgtgtagaca cacacgcata cacacatata atggaagcaa taagccat-3'; SEQ 
ID NO: 123 is 5'-gttatt ttatatatat aatatatata taaaatatat aatatacaat ataaatatat agtgtgtgtg tgtatgcgtg 
tgtgtagaca cacacgcata cacacatata atggaagcaa taagccat-3'; SEQ ID NO:124 is 5'-ttatt ttatatatat 
aatatatata taaaatatat aatatacaat ataaatatat agtgtgtgtg tgtatgcgtg tgtgtagaca cacacgcata cacacatata 
atggaagcaa taagccat-3'; SEQ ID NO:125 is 5'-ttgggg gaaaagtttc tttcagagag ttaagttatt ttatatatat 
aatatatata taaaatatat aatatacaat ataaatatat agtgtgtgtg tgtatgcgtg tgtgtagaca cacacgcata cacacatata 
atggaagcaa taagcca-3'; SEQ ID NO:126 is 5'-ttgggg gaaaagtttc tttcagagag ttaagttatt ttatatatat 
aatatatata taaaatatat aatatacaat ataaatatat agtgtgtgtg tgtatgcgtg tgtgtagaca cacacgcata cacacatata 
atggaagcaa taagcc-3'; SEQ ID NO:127 is 5'-ttgggg gaaaagtttc tttcagagag ttaagttatt ttatatatat 
aatatatata taaaatatat aatatacaat ataaatatat agtgtgtgtg tgtatgcgtg tgtgtagaca cacacgcata cacacatata 
atggaagcaa taagc-3'; SEQ ID NO: 128 is 5'-ttgggg gaaaagtttc tttcagagag ttaagttatt ttatatatat 
aatatatata taaaatatat aatatacaat ataaatatat agtgtgtgtg tgtatgcgtg tgtgtagaca cacacgcata cacacatata 
atggaagcaa taag-3'; SEQ ID NO: 129 is 5'-ttgggg gaaaagtttc tttcagagag ttaagttatt ttatatatat 
aatatatata taaaatatat aatatacaat ataaatatat agtgtgtgtg tgtatgcgtg tgtgtagaca cacacgcata cacacatata 
atggaagcaa taa-3'; SEQ ID NO: 130 is 5'-ttgggg gaaaagtttc tttcagagag ttaagttatt ttatatatat 
aatatatata taaaatatat aatatacaat ataaatatat agtgtgtgtg tgtatgcgtg tgtgtagaca cacacgcata cacacatata 
atggaagcaa ta-3'; SEQ ID NO:131 is 5'-ttgggg gaaaagtttc tttcagagag ttaagttatt ttatatatat 
aatatatata taaaatatat aatatacaat ataaatatat agtgtgtgtg tgtatgcgtg tgtgtagaca cacacgcata cacacatata 
atggaagcaa t-3'-; SEQ ID NO: 132 is 5'-ttgggg gaaaagtttc tttcagagag ttaagttatt ttatatatat 
aatatatata taaaatatat aatatacaat ataaatatat agtgtgtgtg tgtatgcgtg tgtgtagaca cacacgcata cacacatata 



atggaagcaa taagccat-3'; SEQ ID NO:117 is 5'-ag ttaagttatt ttatatatat aatatatata taaaatatat 
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atggaagcaa -3'; SEQ ID NO: 133 is 5'-ttgggg gaaaagtttc tttcagagag ttaagttatt ttatatatat 
aatatatata taaaatatat aatatacaat ataaatatat agtgtgtgtg tgtatgcgtg tgtgtagaca cacacgcata cacacatata 
atggaagca-3'; SEQ ID NO: 134 is 5'-ttgggg gaaaagtttc tttcagagag ttaagttatt ttatatatat aatatatata 
taaaatatat aatatacaat ataaatatat agtgtgtgtg tgtatgcgtg tgtgtagaca cacacgcata cacacatata 
5 atggaagc-3'; SEQ ID NO: 135 is 5'-ttgggg gaaaagtttc tttcagagag ttaagttatt ttatatatat aatatatata 
taaaatatat aatatacaat ataaatatat agtgtgtgtg tgtatgcgtg tgtgtagaca cacacgcata cacacatata 
atggaag-3'; SEQ ID NO: 136 is 5*-ttgggg gaaaagtttc tttcagagag ttaagttatt ttatatatat aatatatata 
taaaatatat aatatacaat ataaatatat agtgtgtgtg tgtatgcgtg tgtgtagaca cacacgcata cacacatata 
atggaa-3'; SEQ ID NO: 137 is 5*-ttgggg gaaaagtttc tttcagagag ttaagttatt ttatatatat aatatatata 

10 taaaatatat aatatacaat ataaatatat agtgtgtgtg tgtatgcgtg tgtgtagaca cacacgcata cacacatata atgga-3'; 
SEQ ID NO: 138 is 5'-ttgggg gaaaagtttc tttcagagag ttaagttatt ttatatatat aatatatata taaaatatat 
aatatacaat ataaatatat agtgtgtgtg tgtatgcgtg tgtgtagaca cacacgcata cacacatata atgg-3'; SEQ ID 
NO: 139 is 5'-ttgggg gaaaagtttc tttcagagag ttaagttatt ttatatatat aatatatata taaaatatat aatatacaat 
ataaatatat agtgtgtgtg tgtatgcgtg tgtgtagaca cacacgcata cacacatata atg-3'; SEQ ID NO: 140 is 

15 5'-ttgggg gaaaagtttc tttcagagag ttaagttatt ttatatatat aatatatata taaaatatat aatatacaat ataaatatat 
agtgtgtgtg tgtatgcgtg tgtgtagaca cacacgcata cacacatata at-3'; SEQ ID NO: 141 is 5'-ttgggg 
gaaaagtttc tttcagagag ttaagttatt ttatatatat aatatatata taaaatatat aatatacaat ataaatatat agtgtgtgtg 
tgtatgcgtg tgtgtagaca cacacgcata cacacatata a-3*; SEQ ID NO: 142 is 5'-ttgggg gaaaagtttc 
tttcagagag ttaagttatt ttatatatat aatatatata taaaatatat aatatacaat ataaatatat agtgtgtgtg tgtatgcgtg 

20 tgtgtagaca cacacgcata cacacatata-3'; SEQ ID NO: 143 is 5'-ttgggg gaaaagtttc tttcagagag 

ttaagttatt ttatatatat aatatatata taaaatatat aatatacaat ataaatatat agtgtgtgtg tgtatgcgtg tgtgtagaca 
cacacgcata cacacatat-3'; and SEQ ID NO: 144 is 5'-ttgggg gaaaagtttc tttcagagag ttaagttatt 
ttatatatat aatatatata taaaatatat aatatacaat ataaatatat agtgtgtgtg tgtatgcgtg tgtgtagaca cacacgcata 
cacacala-3\ 

25 In a particularly preferred embodiment, the portion of AE3' and of AE3** is the 102- 

nucleotide nucleic acid stem- loop forming sequence 5'-tatt ttatatatat aatatatata taaaatatat 
aatatacaat ataaatatat agtgtgtgtg tgtatgcgtg tgtgtagaca cacacgcata cacacata-3' (SEQ ID NO:91). 
The stem-loop sequence is located from nt 32,142 to 32,243 of Figure 8, which corresponds 
to the sequence from nt 35,107 to 35,208 of GenBank accession number k02402. It is the 

30 inventors' belieT that the 102-nt stem-loop forming sequence exhibits the age-regulatory 
functions, described herein, of the AE3* and AE3" sequences. 
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The nucleotide sequence of SEQ ID NOs:l and 3, portions, homologs and antisense 
sequences thereof may be synthesized by synthetic chemistry techniques which are 
commercially available and well known in the art [see Caruthers MH et al.^ (1980) Nuc. 
Acids Res. Symp. Ser. 215-223; Horn T. et aL, (1980) Nuc. Acids Res. Symp. Ser. 225- 
5 232]. Additionally, fragments of SEQ ID NOs:l and 3 can be made by treatment of SEQ 
ID NOs:l and 3 with restriction enzymes followed by purification of the fragments by gel 
electrophoresis. Alternatively, sequences may also be produced using the polymerase chain 
reaction (PGR) as described by MuUis [U.S. Patent Nos. 4,683,195, 4,683,202 and 
4,965,188, all of which are hereby incorporated by reference]. SEQ ID NOs:l and 3, 

10 portions, homologs and antisense sequences thereof may be ligated to each other or to 
heterologous nucleic acid sequences using methods well known in the art. 

The nucleotide sequence of synthesized sequences may be confirmed using 
commercially available kits as well as using methods well known in the art which utilize 
enzymes such as the Klenow fragment of DNA polymerase I, Sequenase®, Taq DNA 

15 polymerase, or thermostable T7 polymerase. Capillary electrophoresis may also be used to 
analyze the size and confirm the nucleotide sequence of the products of nucleic acid 
synthesis, restriction enzyme digestion or PGR amplification. 

It is readily appreciated by those in the art that the sequences of the present invention 
may be used in a variety of ways. For example, the nucleic acid sequences of the invention 

20 and portions thereof can be used as probes for the detection and isolation of functional 

homologs of AE5' and AE3', amplification of homologous nucleotide sequences, age-related 
and/or liver-specific expression of a nucleotide sequence of interest in an animal, gene 
therapy, and reducing factor IX levels in an animal. 

25 ii. Regulatory Nucleic Acid Sequences From The hPC Gene 

The invention provides regulatory nucleic acid sequences which are derived from the 
hPC gene. The presence of these sequences and their surprising properties was fortuitously 
discovered by the inventors during their investigation of the universality of the fimction of 
regulatory sequences from the hFIX gene. In particular, when using what they believed 

30 would be "control" constructs which contained different portions of the sequence upstream of 
nt +1 in the hPC gene, the inventors discovered that whereas transgenic animals containing 
the -1462hPCml construct exhibited relatively constant and relatively high levels (from 
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about 100 to about 3000 ng/ml) of human protein C over time (Figure 17A), in dramatic 
contrast, transgenic animals containing the -82hPCml construct exhibited relatively low 
levels (from about 5 to about 40 ng/ml at 1 month of age) (Figure 1 7B) of human protein C 
which declined at a precipitous rate over time (Example 12). Indeed, by the age of 5 
5 months, human protein C levels were undetectable in all transgenic animals harboring the 
-82hPCml construct. These results demonstrated to the inventors that the nucleotide 
sequence from nt -1462 to nt -83 of the human protein C gene exhibits age-related 
regulatory activity (as evidenced by age-stable expression over time) and regulatory activity 
(as evidenced by the relatively higher levels of expression as compared to the levels in the 
10 absence of the nucleotide sequence from nt -1462 to nt -83) of operably linked sequences of 
interest. 

The tenn "regulatory activity" when made herein in reference to a nucleic acid 
sequence refers to the ability of the nucleic acid sequence to alter the level of transcription 
into mRNA and/or the synthesis of a polypeptide encoded by a nucleotide sequence of 

15 interest which is operably linked to a promoter sequence, as compared to the level of 
transcription into mRNA and/or synthesis of the polypeptide encoded by the nucleotide 
sequence of interest which is operably linked to the promoter sequence in the absence of the 
nucleic acid sequence which has regulatory activity. In contrast to "age-related regulatory 
activity," the term "regulatory activity" relates to aUeration in the level of transcription or 

20 protein synthesis at a single time point (rather than over a period of time) in the life cycle of 
a cell, tissue, or organism. In a preferred embodiment, the level of transcription into mRNA 
and/or synthesis of polypeptide is increased from 2-fold to at least 10,000-fold, preferably 
from 2-fold to 10,000 fold^ more preferably from 2-fold to 1,000-fold, and most preferably, 
from 2-fold to 500-fold, when compared to the level of transcription into mRNA and/or 

25 synthesis of the polypeptide encoded by the nucleotide sequence of interest which is operably 
linked to the promoter sequence in the absence of the nucleic acid sequence which has 
regulatory activity. Data presented herein demonstrates that the inclusion of the human 
protein C (hPC) nucleotide sequence from nt -1462 to nt -83 resulted in expression of from 
about 100 to about 3000 ng/ml of human protein C (Figure 17 A) as compared to expression 

30 of from about 5 to about 40 ng/ml of human protein C (Figure 17B) in transgenic mice at 
one-month of age. The increased level of expression was even more dramatic when the 
transgenic animals were 5 months-old; while the levels of hPC were undetectable in the 
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absence of the sequence from nt -1462 to nt -83, they were from about 100 to about 3000 
ng/ml in the presence of this sequence. 

In one embodiment, the invention provides the nucleic acid sequence (SEQ ID 
NO:85) from -1462 to +1 of the hPC gene (Figure 14; 5'-GAATTCTGTA AGCATTTCCT 
5 ATGTGTACCT GCCCCTGGGC AAGGTGGGCC TGACTTGTTA GAGTGTTAGA 
GTTTTACCCT GTTCCTCTAG GAGGGCCTGG TACCACCACA GCCCAGCATG 
GTGTGGTGCC TCAGCAGGAG GCATCTGGTT ACAATCAACA CAAGCTGTTC 
CAGCCAATTT AAAGAAACTT CAGGAGGAAT AGGGTTTTAG GAGGGCATGG 
GGACCCTCCT GCACCCGAAG CCAGGATGTG CCACCAATCA TAAGGAGGCA 

10 GGGGCCTCCT TCCGCTGCTC CCTGGGACTC TCTAGGTGTC CGTGGCCTCA 
GCCCCCCTCT GCACACCTGC ATCTtCCTTC TCATCAGCTT CCTCTGCTTT 
AAGCGTAAAC ATGGATGCCC AGGACCTGGC CTCAATCTTC CGAGTCTGGT 
ACTTATGGTG TACTGACAGT GTGAGACCCT ACTCCTCTGA TCAATCCCCT 
GGGTTGGTGA CTTCCCTGTG CAATCAATGG AAGCCAGCGA GGCAGGGTCA 

15 CATGCCCCGT TTAGAGGTGC AGACTTGGAG AAGGAACGTG GGCAAGTCTT 
CCCAGGAACA GGTAGGGCAG GGAGGAAAGG GGGGCATCTC TGGTGCAGCC 
CGGTTCGGAG CAGGAAGACG CTTAATAAAT GCTGATAGAC TGCAGGACAC 
AGGCAAAGGT GCTGAGCTGG ACCCTTTATT TCTGCCCTTC TCCCTTCTGG 
CACCCCGGCC AGGAAATTGC TGCAGCCTTT CTGGAATCCC GTTCATTTTT 

20 CTTACTGGTC CACAAAAGGG GCCAAATGGA AGCAGCAAGA CCTGAGTTCA 
AATTAAATCT GCCAACTACC AGCTCAGTGA ATCTGGGCGA GTAACACAAA 
ACTTGAGTGT CCTTACCTGA AAAATAGAGG TTAGAGGGAT GCTATGTGCC 
ATTGTGTGTG TGTGTTGGGG GTGGGGATTG GGGGTGATTT GTGAGCAATT 
GGAGGTGAGG GTGGAGCCCA GTGCCCAGCA CCTATGCACT GGGGACCCAA 

25 AAAGGAGCAT CTTCTCATGA TTTTATGTAT CAGAAATTGG GATGGCATGT 
CATTGGGACA GCGTCTTTTT TCTTGTATGG TGGCACATAA ATACATGTGT 
CTTATAATTA ATGGTATTTT AGATTTGACG AAATATGGAA TATTACCTGT 
TGTGCTGATC TTGGGCAAAC TATAATATCT CTGGGCAAAA ATGTCCCCAT 
CTGAAAAACA GGGACAACGT TCCTCCCTCA GCCAGCCACT ATGGGGCTAA 

30 AATGAGACCA CATCTGTCAA GGGTTTTGCC CTCACCTCCC TCCCTGCTGG 
ATGGCATCCT TGGTAGGCAG AGGTGGGCTT CGGGCAGAAC AAGCCGTGCT 
GAGCTAGGAC CAGGAGTGCT AGTGCCACTG TTTGTCTATG GAGAGGGAGG 
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CCTCAGTGCT GAGGGCCAAG CAAATATTTG TGGTTATGGA TTA-3'). This 
sequence was successfully used to express hPC in transgenic mice in an age-related manner 
{/.e., at relatively constant levels over time; Figure 17A), and at relatively high levels (e.g.^ 
as compared to expression levels in the presence of sequences from nt -1462 to nt -82; 
5 Figure 17B). 

The invention also provides the nucleotide sequence (SEQ ID NO:92) from -1462 to 
-83 of the hPC gene (Figure 14). The finding that SEQ ID NO:92 exhibited age-related 
regulatory activity and regulatory activity as discussed in Example 11, infra, was surprising 
because it was contrary to the results which had previously been reported by Miao et aL 

10 [Miao et al. (1996) J. Biol. Chem. 16:9587-9594] when using a heterologous reporter gene, 
chloramphenicol acetyltranferase (CAT), under the transcriptional control of varying lengths 
of the protein C 5 '-end sequences. In particular, Miao et al. found that construct pPC-1528 
which contained nucleotides -1462 to +1 resulted in substantially reduced CAT in vitro 
activity as compared to construct pPC-82-66 which contained nucleotides -82 to +1. From 

15 this, Miao et ah concluded that there is an element with silencer activity in the region 
between -1462 and -82. In contrast, data presented herein demonstrates that nucleotides 
-1462 to +1 and -82 to +1 resulted in similar in vitro activities of hPC (Example 11, Figure 
16). Indeed, the inventors' data disclosed in relation to hPC's age-related regulatory 
sequences which are upstream of the hPC coding sequences when used to regulate expression 

20 of hPC is consistent with the inventor's observation (Figure 1) in relation to hFIX's age- 
related regulatory AE5' sequences when used to regulate expression of hFIX. In particular, 
the inventors have observed that hFIX's AE3' and AE3" sequences resulted in moderate 
suppression in in vitro transient expression assays when using hFIX and hPC, respectively 
(Figures 1 and 16). . 

25 Further contemplated to be within the scope of this invention are portions of the SEQ 

ID NOS:85 and 92. In a particularly preferred embodiment, these portions contain one or 



both of the first PEA-3 element (SEQ ID NO:89) [7-bp long; 5'-GAGGAAA-3', from -871 
to -865 of the hPC gene of Figure 14] and the second PEA-3 element (SEQ ID 'NO:90) [7- 
bp long; 5'-CAGGAAG-3', from -832 to -826 of the hPC gene of Figure 14]. 



30 



Exemplary portions of SEQ ID NOs:85 and 92 which contain both the first and 
second PEA-3 elements include, but are not limited to, the nucleotide sequence (SEQ ID 
NO:88) iFrom -1462 to -802 of the hPC gene, which is embodied in plasmid -849hPCml 
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(Figure 15, Example 10). Further examples include the nucleotide sequence (SEQ ID 
NO:145) from -1462 to -804, (SEQ ID NO:146) from -1462 to -805, (SEQ ID NO:147) 
from -1462 to -806, (SEQ ID NO:148) from -1462 to -807, (SEQ ID NO:149) from -1462 
to -808, (SEQ ID NO:150) from -1462 to -809, (SEQ ID NO:151) from -1462 to -810, 
5 (SEQ ID NO: 152) from -1462 to -81 1. (SEQ ID NO: 153) from -1462 to -812, (SEQ ID 
NO: 154) from -1462 to -813, (SEQ ID NO: 155) from -1462 to -814, (SEQ ED NO: 156) 
from -1462 to -815, (SEQ ID NO:157) from -1462 to -816, (SEQ ID NO:158) from -1462 
to -817, (SEQ ID NO:159) from -1462 to -818, (SEQ ID NO:160) from -1462 to -819, 
(SEQ ED NO:161) from -1462 to -820, (SEQ ID NO:162) from -1462 to -821. (SEQ ED 

10 NO: 163) from -1462 to -822, (SEQ ID NO:164) from -1462 to -823, (SEQ ED NO:i65) 

from -1462 to -824, (SEQ ID NO:166) from -1462 to -825, (SEQ ID NO:167) from -1462 
to -826, (SEQ ID NO:168) from -1452 to -803, (SEQ ID NO: 169) from -1442 to -803, 
(SEQ ID NO: 170) from -1412 to -803, (SEQ ID NO: 171) from -1102 to -803, (SEQ ID 
NO: 172) from -902 to -803, and (SEQ ID NO: 173) from -873 to -803. 

15 Portions of SEQ ID NOs:85 and 92 which contain only the first PEA-3 element 

include the nucleotide sequence (SEQ ID NO:87) from -1462 to -849 of the hPC gene, 
which is embodied in plasmid -802hPCml (Figure 15, Example 10). Additional exemplary 
portions include, but are not limited to the nucleotide sequence (SEQ ED NO: 174) from 
-1462 to -850, (SEQ ID NO:175) from -1462 to -851, (SEQ ID NO:176) from -1462 to 

20 -852, (SEQ ID NO: 177) from -1462 to -853, (SEQ ID NO: 178) from -1462 to -854, (SEQ 
ID NO:179) from -1462 to -855, (SEQ ID NO:180) from -1462 to -856, (SEQ ID NO:181) 
.from -1462 to -857, (SEQ ED NO:182) from -1462 to -858, (SEQ ID NO:183) from -1462 
to -859, (SEQ ID NO: 184) from -1462 to -860, (SEQ ID NO: 185) from -1462 to -861, 
(SEQ ID NO:186) from -1462 to -862, (SEQ ID NO:187) from -1462 to -863, (SEQ ID 

25 NO:188) from -1462 to -864, (SEQ ID NO:189) from -1462 to -865, (SEQ ID NO:190) 

from -1362 to -865, (SEQ ED NO:191) from -1262 to -865, (SEQ ID NO:192) from -1162 
to -865, (SEQ ID NO:193) from -1062 to -865, (SEQ ED NO:194) from -962 to -865, and 
(SEQ ID NO: 195) from -872 to -865. 

Examples of portions of SEQ ID NOs:85 and 92 which contain only the second PEA- 

30 3 element include, but are not limited to, the nucleotide sequence (SEQ ID NO: 196) from 

-863 to -83, (SEQ ID NO: 197) from -853 to -83, (SEQ ID NO: 198) from -843 to -83, (SEQ 
ID NO: 199) from -833 to -83, (SEQ ID NO:200) from -832 to -83, (SEQ ID NO:201) from 
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-863 to -183, (SEQ ID NO:202) from -863 to -283, (SEQ ID NO:203) from -863 to -383, 
(SEQ ID NO:204) from -863 to -483, (SEQ ID NO:205) from -863 to -583, (SEQ ID 
NO:206) from -863 to -683, (SEQ ID NO:207) from -863 to -783, and (SEQ ID NO:208) 
from -863 to -826. 

In a particularly preferred embodiment, the portion of SEQ ID NOs:85 and 92 is 
selected from the first PEA-3 element (SEQ ID NO:89) and the second PEA-3 element (SEQ 
ID NO:90). It is the inventor's view that the first and/or second PEA-3 elements within 
SEQ ID NOs:85 and 92 are responsible for the observed age-related regulatory activity and 
regulatory activity of SEQ ID NOs:85 and 92. 



B* Using Probes To Identify And Isolate Homologs Of A£S% A£3% and Of 
hPC-Derived Regulatory Sequences 

The invention provided herein is not limited to SEQ ID NO:l, 3, 85 and 92, 
homologs and portions thereof having age-related regulatory activity, but includes sequences 

15 having no age-related regulatory activity (Le., non-functional homologs and non-ftmctional 
portions of homologs). The use of such sequences may be desirable, for example, where a 
portion of SHQ ID NOs:l, 3, 85, and 92 is used as a probe to detect the presence of SEQ ID 
NOs:l, 3, 85 and 92, respectively, or of portions thereof in a sample. 

As used herein, the term "probe" refers to an oligonucleotide, whether occurring 

20 naturally as in a purified restriction digest or produced synthetically, recombinantly or by 
PGR amplification, which is capable of hybridizing to a nucleotide sequence of interest. A 
probe may be single-stranded or double-stranded. It is contemplated that any probe used in 
the present invention will be labelled with any "reporter molecule," so that it is detectable in 
any detection system including, but not limited to enzyme (e.^., ELISA, as well as enzyme- 

25 based histochemical assays), fluorescent, radioactive, calorimetric, gravimetric, magnetic, and 
luminescent systems. It is not intended that the present invention be limited to any particular 
detection system or label. 

The probes provided herein are useful in the detection, identification and isolation of, 
for example, sequences such as those listed as SEQ ID NOs:l, 3, 85 and 92 as well as of 

30 homologs thereof Preferred probes are of sufficient length (e.g.^ from about 9 nucleotides 
to about 20 nucleotides or more in length) such that high stringency hybridization may be 
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employed. In one embodiment, probes from 20 to 50 nucleotide bases in length are 
employed. 



C. Using Primers to Amplify At Least A Portion Of AE5% AE3% and Of 
5 hPC-Derived Regulatory Sequences 

The invention provided herein is not limited to SEQ ID NOs: 1 and 3, homologs and 
portions thereof having age-related regulatory activity, but includes sequences having no age- 
related regulatory activity. This may be desirable, for example, where a portion of the 
nucleic acid sequences set forth as SEQ ID NOs:l and 3 is used as a primer for the 

10 amplification of nucleic acid sequences by, for example, polymerase chain reactions (PCR) 
or reverse transcription-polymerase chain reactions (RT-PCR). The term "amplification" is 
defined as the production of additional copies of a nucleic acid sequence and is generally 
carried out using polymerase chain reaction technologies well known in the art [Dieffenbach 
CW and GS Dveksler (1995) PCR Primer, a Laboratory Manual, Cold Spring Harbor Press, 

15 Plainview NY]. As used herein, the term "polymerase chain reaction" ("PCR") refers to the 
method of K.B. MuUis disclosed in U.S. Patent Nos. 4,683,195, 4,683,202 and 4,965,188, all 
of which are hereby incorporated by reference, which describe a method for increasing the 
concentration of a segment of a target sequence in a mixture of genomic DNA without 
cloning or purification. This probess for amplifying the target sequence consists of 

20 introducing a large excess of two oligonucleotide primers to the DNA mixture containing the 
desired target sequence, followed by a precise sequence of thermal cycling in the presence of 
a DNA polymerase. The two primers are complementary to their respective strands of the 
double stranded target sequence. To effect amplification, the mixture is denatured and the 
primers then annealed to their complementary sequences within the target molecule. 

25 Following annealing, the primers are extended with a polymerase so as to form a new pair of 
complementary strands. The steps of denaturation, primer annealing and polymerase 
extension can be repeated many times (i.e., denaturation, annealing and extension constitute 
one "cycle"; there can be numerous "cycles") to obtain a high concentration of an amplified 
segment of the desired target sequence. The length of the amplified segment of . the desired 

30 target sequence is determined by the relative positions of the primers with respect to each 

other, and therefore, this length is a controllable parameter. By virtue of the repeating aspect 
of the process, the method is referred to as the "polymerase chain reaction" (hereinafter 
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"PCR")- Because the desired amplified segments of the target sequence become the 
predominant sequences (in terms of concentration) in the mixture, they are the to be "PCR 
amplified." 

With PGR, it is possible to amplify a single copy of a specific target sequence in 
5 genomic DNA to a level detectable by several different methodologies (e.g., hybridization 
with a labeled probe; incorporation of biotinylated primers followed by avidin-enzyme 
conjugate detection; and/or incorporation of ^^P-labeled deoxyribonucleotide triphosphates, 
such as dCTP or dATP, into the amplified segment). In addition to genomic DNA, any 
nucleotide sequence can be amplified with the appropriate set of primer molecules. In 
10 particular, the amplified segments created by the PGR process itself are, themselves, efficient 
templates for subsequent PGR amplifications. Amplified target sequences may be used to 
obtain segments of DNA (e.g., genes) for the construction of targeting vectors, transgenes, 
etc. 

As used herein, the term "primer" refers to an oligonucleotide, whether occurring 

15 naturally as in a purified restriction digest or produced synthetically, which is capable of 
acting as a point of initiation of synthesis when placed under conditions in which synthesis 
of a primer extension product which is complementary to a nucleic acid strand is induced, 
(i.e., in the presence of nucleotides and an inducing agent such as DNA polymerase and at a 
suitable temperature and pH). The primer is preferably single stranded for maximum 

20 efficiency in amplification, but may alternatively be double stranded. If double stranded, the 
primer is first treated to separate its strands before being used to prepare extension products. 
Preferably, the primer is an oligodeoxyribonucleotide. The primer must be sufficiently long 
(e.g., from about 9 nucleotides to about 20 nucleotides or more in length) to prime the 
synthesis of extension products in the presence of the inducing agent. Suitable lengths of the 

25 primers may be empirically determined and depend on factors such as temperature, source of 
primer and the use of the method. In one embodiment, the present invention employs 
primers firom 20 to 50 nucleotide bases in length. 

The primers contemplated by the invention are useful in, for example, identifying 
sequences which are homologous to AE5', AE3% and regulatory sequences derived from 

30 hPG, in mammals, yeast, bacteria, and in other organisms. 



- 47 . 



wo 00/75279 




D. Methods For Regulating Gene Expression 

The present invention provides methods for regulating expression of a nucleotide 
sequence of interest over a period of time in a cell or multicellular organism. Specifically, 
gene expression is preferably regulated in a multicellular organism. In one embodiment, 
5 expression of a nucleotide sequence of interest is stabilized such that the level of mRNA 
and/or protein encoded by the nucleotide sequence of interest remains relatively unchanged 
at different times during the life of the organism. In an alternative embodiment, expression 
of a nucleotide sequence of interest is increased. Increased expression means that the level 
of mRNA and/or protein encoded by the nucleotide sequence of interest at a given time point 

10 is greater than the level of mRNA and/or protein, respectively, at an earlier time point during 
the Ufe of the organism or cell. Alternatively, increased expression means that the level of 
mRNA and/or protein encoded by the nucleotide sequence of interest is greater than the level 
of mRNA and/or protein, respectively, at the same time point in the life of the organism or 
cell as compared to the level of mRNA and/or protein when expressed in the absence of the 

1 5 sequences of the invention. 

In one embodiment, regulating expression of a nucleotide sequence of interest over a 
period of time is accomplished by introducing into a host cell a vector that contains a 
nucleotide sequence of interest operably linked to a promoter sequence and to sequences 
provided herein which have age-related regulatory activity. The transfected host cell is 

20 allowed to develop into a transgenic animal in which the nucleotide sequence of interest is 
expressed in at least one tissue. These steps are further described below for specific 
embodiments. 



1. Expression Constructs 

25 In one embodiment of the methods of the invention for regulating expression of a 

nucleotide sequence of interest in an age-related manner and/or to liver tissue, a vector is 
constructed in which the nucleic acid sequences of the invention (e.g., AE5' alofte, AE3' 
alone, or a combination of AE5' and AE3') are operably linked to a promoter sequence and 
to a nucleotide sequence of interest. In one embodiment, the nucleotide sequence of interest 

30 is the coding region of the hFIX gene (Example 1). In another embodiment the nucleotide 
sequence of interest is the coding region of the protein C gene (Example 7). 
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The invention is not limited to coding sequences of the hFIX gene or protein C gene. 
Rather, any nucleotide acid sequence whose expression is desired to be regulated by 
sequences provided herein are contemplated to be within the scope of this invention. Such 
nucleotide sequences include, but are not limited to, coding sequences of structural genes 



gene, eta]. In one preferred embodiment, the structural gene is the human a 1 -antitrypsin 
gene (Figure 10) (SEQ ID NO:42) which encodes a plasma proteinase inhibitor used for 
treating emphysema. In another preferred embodiment, the structural gene is one encoding 

10 the human antithrombin III (Figure 1 1) (SEQ ID NO:43) which is a plasma proteinase 

inhibitor for activated blood coagulation factors and whose activity is increased by heparin. 
In yet another preferred embodiment, the structural gene is the gene encoding the PEA-3 
protein (Figure 9) (SEQ ID NO:47) and/or its related protein, which has been shown to bind 
specifically to homologs of the PEA-3 nucleotide sequence (SEQ ID NO:2) disclosed herein. 

15 The invention is not limited to using a single nucleotide sequence of interest in 

operable combination with the sequences of the invention. Rather, a plurality (i.e., more 
than one) of nucleotide sequences of interest may be ligated in tandem such that their 
expression is regulated by the regulatory sequences of the invention, A plurality of coding 
sequences may be desirable, for example, where it is useful to express a transcription product 

20 of more than one gene to permit interaction of these transcriptional products. Alternatively, 
a plurality of coding sequences may be desirable where one of the gene sequences is a 
reporter gene sequence. For example, it may be advantageous to place a coding sequence of 
a reporter gene, in tandem with the coding sequence of a gene of interest such that expression 
of the coding region of both the reporter gene and the gene of interest is regulated by the 

25 sequences of the invention. Expression of the reporter gene usually correlates with 
expression of the gene of interest. Examples of reporter gene sequences include the 
sequences encoding the enzymes P-galactosidase and luciferase. Fusion genes may also be 
desirable to facilitate purification of the expressed protein. For example, the heterologous 
sequence which encodes protein A allows purification of the fusion protein on immobilized 

30 immunoglobulin. Other affinity traps are well known in the art and can be utilized to 

advantage in purifying the expressed fusion protein. For example, pGEX vectors (Promega, 
Madison WI) may be used to express the polypeptides of interest as a fusion protein with 



5 



which encode a protein [e.g,, reporter genes, selection marker genes, oncogenes, drug 
resistance genes, growth factor genes, activator protein 1 gene, activator protein 2 gene, Spl 
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glutathione S-transferase (GST). In general, such fusion proteins are soluble and can easily 
be purified from lysed cells by adsorption to glutathione-agarose beads followed by elution 
in the presence of free glutathione. Other fusion polypeptides useful in the purification of 
proteins of interest are commercially available, including histidine tails (which bind to Ni^**"), 
5 biotin (which binds to streptavidin), and maltose-binding protein (MBP) (which binds to 

amylose). Proteins made in such systems may be designed to include heparin, thrombin or 
factor XA protease cleavage sites so that the cloned polypeptide of interest can be released at 
will from the heterologous polypeptide moiety to which it is fused. 

One of skill in the art would understand that where a plurality of nucleotide 
1 0 sequences of interest is operably linked to sequences of the present invention, the nucleotide 
sequences of interest may be either contiguous or separated by intervening polynucleotide 
sequences, so long as the nucleic acid sequences of interest are operably linked to the 
promoter sequence, and so long as the sequences of the invention are operably linked to the 
promoter sequence. 

15 While specific preferred embodiments used herein disclose the use of the hFIX 

promoter and the CMV promoter, it is not intended that the invention be limited to the type 
or source of the promoter sequence which is operably linked to the sequences of the 
invention. Any promoter whose activity is desired to be regulated by the sequences provided 
herein is contemplated to be within the scope of the invention. Exemplary promoters include 

20 the tRNA promoter, 5S rRNA promoters, histone gene promoters, RSV promoter (can be 
isolated from vector plasmid pRc/RSV from Invitrogen), retrovirus LTR promoter (can be 
isolated from' vector plasmid pLXSN from Clontech) S V40 promoter (located between 
positions +3530 to +3192 in vector plasmid pCR3 from Invitrogen), PEPCK promoter, MT 
promoter, SRa promoter, P450 family promoters, GAL7 promoter, promoter having the 

25 23-bp sequence (SEQ ID NO:44) 5'-TAATACGACTCACTATAGGGCGA-3% % promoter 
having the 24-bp sequence (SEQ ID NO:45) 5'-TTATTAACCCTCACTAAAGGGAAG -3', 
SP6 promoter having the 23-bp sequence (SEQ ID NO:46) 5'- 
ATTTAGGTGACACTATAGAATAC -3% and Kll promoter. The promoter, T3 
promoter, SP6 promoter and Kl 1 promoter have been described in U.S Patent No. 

30 5,591,601, the entire contents of which are incorporated by reference. 

Nor is the invention intended to be limited to the use of a single promoter. For 
example, chimeric promoters (/.e., two or more promoters which are derived from at least 
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one gene) are expressly contemplated to be within the scope of the invention. Such chimeric 
promoters may be desirable where, for example, chimeric promoters result in increased levels 
of expression of an operably linked downstream coding sequence. Chimeric promoters are 
known in the art and include, for example, the double Tet promoter [KJstner et al. (1996) 
5 Proc, NatK Acad. Sci. USA 93:10933-10938], and the Ul snRNA promoter-CMV 

promoter/enhancer [Bartlett et al. (1996) Proc. Natl. Acad. Sci. USA 93:8852-8857]. 

Expression vectors in which expression of a nucleic acid sequence of interest is 
regulated by sequences of the invention may be constructed using the teachings of the 
present invention in conjunction with techniques well known in the art. [Sambrook et al 

10 (1989) Molecular Cloning, A Laboratory Manual^, Cold Spring Harbor Press, Plainview NY; 
Ausubel et al (1989) Current Protocols in Molecular Biology, John Wiley & Sons, New 
York NY], Briefly, the nucleic acid sequence of interest is placed in operable combination 
with a promoter sequence and sequences of the invention in the presence of transcription and 
translation regulatory sequences, including initiation signals such as a start codon (/.e., 

15 ATG), enhancers, and transcription termination signals. The ATG initiation codon must be 
in the correct reading frame to ensure translation of the entire heterologous nucleotide 
sequence. Transcription termination signals are placed downstream of the heterologous 
nucleic acid sequence and include polyadenylation sequences which are exemplified by, but 
not limited to, SV40 poly-A sequence, hINV poly-A sequence, or bovine growth hormone 

20 poly-A sequence, etc. 

Other regulatory sequences which may affect RNA stabihty as well as enhancers {i.e., 
a sequence which when activated results in an increase in the basal rate of transcription of a 
gene) and silencers (/.e., a sequence involved in reducing expression of a gene) may also be 
included. These regulatory sequences may be relatively position-insensitive, i.e., the 

25 regulatory element will function correctly even if positioned differently in relation to the 
heterologous nucleotide sequence in the construct as compared to its position in relation to 
the corresponding heterologous nucleotide sequence in the genome. For example, an 
enhancer may be located at different distances from the promoter sequence, in a different 
orientation, and/or in a different linear order. Thus, an enhancer that is located 3 Mo a 

30 promoter sequence in germline configuration might be located 5' to the promoter sequence 
in the construct. 
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2. Host Cells 

Host cells are transformed with expression vectors which contain the sequences of the 
invention in operable combination with a nucleic acid sequence of interest using methods 
known in the art. The term "transformation" as used herein refers to the introduction of a 
5 transgene into a cell. The term "transgene" as used herein refers to any nucleic acid 
sequence which is introduced into the genome of a cell by experimental manipulations. 

The term "transgene" as used herein refers to any nucleic acid sequence which is 
introduced into the genome of a cell by experimental manipulations. A transgene may be an 
"endogenous DNA sequence," or a "heterologous DNA sequence." The tenn "endogenous 

10 DNA sequence" refers to a nucleotide sequence which is naturally found in the cell into 
which it is introduced so long as it does not contain some modification {e.g.y a point 
mutation, the presence of a selectable marker gene, etc.) relative to the naturally-occurring 
sequence. The terms "heterologous DNA sequence" and "foreign DNA sequence" refer to a 
nucleotide sequence which is ligated to, or is manipulated to become ligated to, a nucleic 

15 . acid sequence to which it is not ligated in nature, or to which it is ligated at a different 
location in nature. Heterologous DNA is not endogenous to the cell into which it is 
introduced, but has been obtained from another cell. Heterologous DNA also includes an 
endogenous DNA sequence which contains some modification (e.g.y a point mutation, the 
presence of a selectable marker gene, etc.) relative to the naturally-occurring gene. 

20 Generally, although not necessarily, heterologous DNA encodes RNA and proteins that are 
not normally produced by the cell into which it is expressed. Examples of heterologous 
DNA include reporter genes, transcriptional and translational regulatory sequences, selectable 
marker proteins (e.g, , proteins which confer drug resistance), etc. 

Transformation may be accomplished by a variety of means known to the art 

25 including calcium phosphate-DNA co-precipitation, DEAE-dextran-mediated transfection, 

polybrene-mediated transfection, electroporation, microinjection, liposome fusion, lipofection, 
protoplast fusion, retroviral infection, biolistics (i.e., particle bombardment) and the like. 

Transformation of a cell may be stable or transient. The term "transient 
transformation" or "transiently transformed" refers to the introduction of one or more 

30 transgenes into a cell in the absence of integration of the transgene into the host cell's 
genome. Transient transformation may be detected by, for example, enzyme-linked 
immunosorbent assay (ELISA) which detects the presence of a polypeptide encoded by one 
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or more of the transgenes. Alternatively, transient transformation may be detected by 
detecting the activity of the protein encoded by the transgene. For example, the activity of 
p-glucuronidase (GUS) which is encoded by the uid A gene may be detected using either a 
histochemical assay of GUS enzyme activity by staining with X-gluc which gives a blue 
5 precipitate in the presence of the GUS enzyme, or a chemiluminescent assay using the 
GUS-Light kit (Tropix). The term "transient transformant" refers to a cell which has 
transiently incorporated one or more transgenes. In contrast, the term "stable transformation" 
or "stably transformed" refers to the introduction and integration of one or more transgenes 
into the genome of a cell. Stable transformation of a cell may be detected by Southern blot 

10 hybridization of genomic DNA of the cell with nucleic acid sequences which are capable of 
binding to one or more of the transgenes. Alternatively, stable transformation of a cell may 
also be detected by the polymerase chain reaction (PGR) of genomic DNA of the cell to 
amplify transgene sequences. The term "stable transformant" refers to a cell which has 
stably integrated one or more transgenes into the genomic DNA. Thus, a stable transformant 

15 is distinguished from a transient transformant in that, whereas genomic DNA from the stable 
transformant contains one or more transgenes, genomic DNA from the transient transformant 
does not contain a transgene. 

Suitable host cells include bacterial, yeast, plant, insect, and mammalian cells. In one 
embodiment the host cell is mammalian. In a preferred embodiment, the mammalian host 

20 cell is a mouse fertilized egg cell. In an alternative embodiment, the mammalian host cell is 
a HepG2 cell (ATCC number HB8065), a fibroblast cell (e.^., ATCC number CCL 110), a 
myoblast cell (e.g., Clonetics, catalog # SkMC), and an endothelial cell (e.g., human 
umbilical cord endothelial cells; ATCC number CRL 1730). 

In one embodiment, the host cell is transformed both with an expression vector which 

25 contains the sequences of the invention in operable combination with the nucleic acid 
sequences of interest, as well as with an expression vector which expresses the PEA-3 
protein (Example 6). Such co-transformation may be desirable, for example, where 
expression of the nucleotide sequence of interest is regulated by AE5* or portions or 
homologs thereof which contain homologs of the PEA-3 nucleotide sequence to which the 

30 PEA-3 protein binds. In one embodiment, expression of the PEA-3 protein is under the 
control of the LTR promoter of the Moloney murine leukemia virus (MoLV) which is 
capable of driving expression of operably linked genes in several cell types. Transient 



- 53 - 



wo 00/75279 




PCT/USOO/15728 



expression assays are suitable for determining the relative promoter activities in expressing 
desirable PEA-3 protein levels. 

Any number of selection systems may be used to recover transfected cells. These 
include, but are not limited to, the herpes simplex virus thymidine kinase (Wigler M et al. 
5 (1977) Cell 11:223-32) and adenine phosphoribosyltransferase (Lowy I et al, (1980) Cell 
22:817-23) genes which can be employed in tk" or aprt" cells, respectively. Also, 
antimetabolite, antibiotic or herbicide resistance can be used as the basis for selection; for 
example, dhfr which confers resistance to methotrexate [Wigler M et aL, (1980) Proc Natl 
Acad Sci 77:3567-70]; wp/, which confers resistance to the aminoglycosides neomycin and 

10 G-418 [Colbere-Garapin F a/., (1981) J. Mol. Biol. 150:1-14] and als or pat, which confer 
resistance to chlorsulfuron and phosphinotricin acetyltransferase, respectively (Murry, supra). 
Additional selectable genes have been described, for example, trpB, which allows cells to 
utilize indole in place of tryptophan, or hisD, which allows cells to utilize histinol in place 
of histidine [Hartman SC and RC Mulligan (1988) Proc Natl Acad Sci 85:8047-51]. 

15 Recently, the use of a reporter gene system which expresses visible markers has gained 

popularity with such markers as p-glucuronidase and its substrate (GUS), luciferase and its 
substrate (luciferin), and p-galactosidase and its substrate (X-Gal) being widely used not only 
to identify transformants, but also to quantify the amount of transient or stable protein 
expression attributable to a specific vector system [Rhodes CA et aL (1995) Methods Mol 

20 Biol 55:121-131]. 

The presence or expression of the reporter gene usually indicates the presence or 
expression, respectively, of the tandem heterologous nucleic acid sequence as well. 
However, it is preferred that the presence and expression of the desired heterologous nucleic 
acid sequence be confirmed. This is accomplished by procedures known in the art which 

25 include DNA-DNA or DNA-RNA hybridization or amplification using probes, or fragments 
of the heterologous nucleic acid sequence. For example. Fluorescent In Situ Hybridization 
(FISH) can be used to detect the heterologous nucleic acid sequence in cells. Several guides . 
to FISH techniques are available, e.g.. Gall et al. Meth. Enzymol. 21:470-480 (1981); 
Angerer et aL, in "Genetic Engineering: Principles and Methods," Setlow & Hollaender, 

30 Eds. Vol. 7 pp. 43-65, Plenum Press, New York (1985). Alternatively. DNA oi* RNA can be 
isolated fi-om cells for detection of the transgene by Southern or Northern hybridization or 
by amplification based assays. Nucleic acid amplification based assays involve the use of 
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oligonucleotides or oligomers based on the nucleotide sequence of interest in order to detect 
cells and tissues which contain the DNA or RNA encoding the transgene of interest. As 
used herein, the terms ^'oligonucleotides" and "oligomers" refer to a nucleic acid sequence of 
at least about five (5) contiguous nucleotide residues and as many as about sixty (60) 



nucleotides, which can be used as a probe or amplimer. Standard PCR methods useful in the 
present invention are described by Innis et al. (Eds.), "PCR Protocols: A Guide to Methods 
and Applications/* Academic Press, San Diego (1990). 



10 includes detecting the polypeptide product of transcription of the heterologous nucleotide 

sequence. A variety of protocols which employ polyclonal or monoclonal antibodies specific 
for the protein product are known in the art. Examples include enzyme-linked 
immunosorbent assay (ELISA), radioimmunoassay (RIA) and fluorescent activated cell 
sorting (FACS). A competitive binding assay may also be used. Alternatively, a two-site, 

15 monoclonal-based immunoassay which utilizes monoclonal antibodies that are reactive to two 
non-interfering epitopes on the protein of interest may be employed. These and other assays 
are described in, among other places, Hampton R et aL (1990), Serological Methods a 
Laboratory Manual, APS Press, St Paul MN), and Maddox DE et aL (1983), J. Exp. Med. 
158:1211. 

20 A wide variety of labels and conjugation techniques are known by those skilled in the 

art and can be used in various nucleic acid and amino acid assays. Means for producing 
labeled hybridization or PCR probes for detecting related sequences include oligolabeling, 
nick translation, end-labeling or PCR amplification using a labeled nucleotide. Alternatively, 
the nucleotide sequence of interest, or any portion of it, may be cloned into a vector for the 

25 production of an mRNA probe. Such vectors are known in the art, are commercially 

available, and may be used to synthesize RNA probes in vitro by addition of an appropriate 
RNA polymerase such as T7, T3 or SP6 and labeled nucleotides. A number of companies 
such as Pharmacia Biotech (Piscataway NJ), Promega (Madison WI), and US Biochemical 
Corp (Cleveland OH) supply conmiercial kits and protocols for these procedures. Suitable 

30 reporter molecules or labels include those radionuclides, enzymes, fluorescent, 

chemiluminescent, or chromogenic agents as well as substrates, cofactors, inhibitors, 
magnetic particles and the like. 



5 



nucleotides, preferably about 15 to 30 nucleotides, and more preferably about 20-25 



Yet another alternative for the detection of heterologous nucleic acid sequences 
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Host cells transformed with expression vectors containing the sequences provided 
herein are useful for age-related expression of recombinant proteins of interest. .Host cells 
transformed with expression vectors containing the invention's sequences may be part of a 
tissue or organ of a living animal. A "living animal" as used herein refers to any 
5 multicellular animal (e.g., humans, non-human primates, ovines, bovines, ruminants, 

lagomorphs, porcines, caprines, equines, canines, felines, aves, etc.) into whose cells the 
sequences provided herein may be introduced. Where the host cells (e.g., fertilized egg 
cells) are capable of generating a multicellular organism, these cells when transformed with 
expression vectors containing the sequences of the invention are useful in generating 
10 transgenic animals which exhibit age-related and/or liver-specific expression of nucleotide 
sequences of interest. 

3. Transgenic Animals 

The present invention provides transgenic non-human animals which express a 

1 5 nucleotide sequence of interest in an age-related manner. These animals provide useful 

models for diseases (e.g., thrombosis, cardiovascular diseases, diabetes, Alzheimer's disease, 
cancer, osteoporosis, osteoarthritis, Parkinson's disease, dementia) which are associated with 
increasing age, as well a for screening candidate therapeutic agents against such diseases. 
These transgenic animals are also useful in studies of normal phenomena, such as ageing, 

20 gene regulation; etc. In one embodiment, the invention discloses transgenic mice which 

express in an age-related manner the exemplary hFIX coding sequence under the control of 
AE5' and/or AE3' (Example 3). 

The term "age-related manner" when made in reference to the expression of a 
nucleotide sequence of interest is a relative term which refers to an increase over a period of 

25 time in the quantity of mRNA and/or protein encoded by the nucleotide sequence of interest 
when the nucleotide sequence of interest is operably linked to a promoter and to a nucleic 
acid sequence which has age-related regulatory activity, as compared to the quantity of 
mRNA and/or protein, respectively, encoded by the nucleotide sequence of interest when the 
nucleotide sequence of interest is operably linked to the promoter in the absence of the 

30 nucleic acid sequence which has age-related regulatory activity. Thus, the term "age-related" 
when made in reference to expression of a nucleotide sequence of interest by a transgenic 
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animal means that the transgenic animal expresses the nucleotide sequence of interest in an 
age-related manner. 

For example, in one embodiment, the invention demonstrates that hFIX is expressed 
in an age-related manner in transgenic mice which harbor a transgene (-416FIXml/1.4) 
5 (Figure 2B) which contains hFIX under the control of the hFIX promoter and the regulatory 
control of AE3' as compared to expression of hFIX in transgenic mice which harbor a 
transgene (-416FIXml) (Figure 2 A) in which hFIX is under the control of the hFIX 
promoter in the absence of AE3*. While transgenic mice harboring the -416FIXml/1.4 
construct showed decreasing hFIX activity levels over a period of time (e.g., from 1 to 9 
10 months of age), this decrease was less than the decrease in hFIX activity levels which was 
observed in transgenic mice harboring the -416FIXml construct over the same period of 
time. 

In another embodiment, the invention discloses that hFIX is expressed in an age- 
related manner m transgenic mice which harbor transgenes (-802FIXml, -2231FIXml, and 

15 •416FIXml/AE5*) (Figures 4A, C and E) each of which contains hFIX under the control of 
the hFIX promoter and the regulatory control of AE5- as compared to expression of hFIX in 
transgenic mice which harbor a transgenes (-416FIXml and -770FIXml) (Figures 2A and E) 
in which hFIX is under the control of the hFIX promoter in the absence of AE5*. ' 
Transgenic mice harboring each of the -802FIXml, -2231FIXml, and -416FIXml/AE5' 

20 constructs showed relatively unchanged hFIX activity levels over a period of time (e,g.^ from 
1 to 7 months of age) while transgenic mice harboring either the -416FIXml or -770FIXml 
constmct showed decreasing hFIX activity levels over the same time period. 

In an additional embodiment, the invention shows that hFIX is expressed in an age- 
related manner in transgenic mice which harbor transgenes (-802FIXml/1.4 and 

25 -2231FIXml/1.4) (Figures 4B and D) each of which contains hFIX under the control of the 
hFIX promoter and the regulatory control of both AE3' and AE5' as compared to expression 
of hFDC in transgenic mice which harbor a transgene (-770FIXml) (Figure 2E) in which 
hFIX is under the control of the hFIX promoter in the absence of both AE3' and AE5\ 
Transgenic mice harboring either the -802FIXml/1.4 or the -2231FIXml/L4 construct 

30 showed increasing levels of hFIX activity over a period of time (e.g., 1 to 3 months of age) 
as compared to decreasing hFIX activity levels over the same period of time in transgenic 
mice harboring the -770FIXml construct. 
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The present mvention also provides transgenic non-human animals which express a 
nucleotide sequence of interest in a liver-specific manner. These animals are useful for 
targeting expression of a nucleotide sequence of interest to the liver. Examples of nucleotide 
sequences of interest are those which encode blood coagulation factors (e.g., factor VIII, 
5 factor VII, factor X and prothrombin) whose deficiency is known to play a role in abnormal 
bleeding disorders. Other examples of nucleotide sequences of interest include those which 
encode blood coagulation regulators and/or inhibitors (e,g,, protein C, antithrombin III, and 
tissue factor pathway inhibitor [TFPI]) whose deficiency results in thrombosis, al- 
. antitrypsin whose deficiency results in emhysima, and LDL-receptor whose deficiency results 

10 in hypercholestrolemia. 

Yet other examples of a nucleotide sequence of interest include those encoding 
enzymes involved in specific metabolic defects (e.g., urea cycle enzymes, especially 
ornithine transcarbamylase, argininosuccinate synthase, and carbamyl phosphate synthase); 
receptors (e.g., LDL receptor); toxins; thymidine kinase to ablate specific cells or tissues; ion 

15 channels (e,g.^ chloride channel of cystic fibrosis); membrane transporters (e.g., glucose 

transporter); and cytoskeletal proteins (e.g,, dystrophin). The nucleotide sequence of interest 
may be of synthetic, cDNA, or genomic origin, or a combination thereof The nucleotide 
sequence of interest may be one which occurs in nature, a non-natural ly occurring gene 
which nonetheless encodes a naturally occurring polypeptide, or a gene which encodes a 

20 recognizable mutant of such a polypeptide. It may also encode an mRNA which will be 

"antisense" to a DNA found or to an mRNA normally transcribed in the host cell, but which 
antisense RNA is not itself translatable into a protein. In one embodiment, the invention 
discloses transgenic mice which express in a liver-specific manner the exemplary hFIX 
coding sequence under the control of AE5* (Example 3). 

25 The term "liver-specific manner" as used herein in reference to the expression of a 

nucleotide sequence of interest in a transgenic animal is a relative term which means that the 
quantity of mRNA and/or protein encoded in liver tissue by the nucleotide sequence of 
interest is greater than, preferably two times greater, more preferably five times greater, and 
most preferably ten times greater, than the quantity of mRNA and/or protein encoded by the 

30 nucleotide sequence of interest in tissues other than liver tissue of the same transgenic animal 
as detected by Northern blot hybridization and/or by the activity of the encoded protein as 
described herein. Thus, the term "liver-specific" when made in reference to expression of a 
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nucleotide sequence of interest by a transgenic animal means that the transgenic animal 
expresses the nucleotide sequence of interest in an liver- specific manner. 

A first step in the generation of the tremsgenic animals of the invention is the 
introduction of a construct containing nucleic acid sequences of interest under the expression 
5 regulatory control of sequences of the invention into target cells. Several methods are 
available for accomplishing this, including microinjection, retroviral infection, and 
implantation of embryonic stem cells. These methods are discussed as follows. 

i. Microinjection Methods 

10 Direct microinjection of expression vectors into pronuclei of fertilized eggs is the 

preferred, and most prevalent, technique for introducing heterologous nucleic acid sequences 
into the germ line. Technical aspects of the microinjection procedure and important 
parameters for optimizing integration of nucleic acid sequences have been previously 
described [Hogan et al,, (1986) Manipulation of the Mouse Embryo: A Laboratory Manual. 

15 Cold Spring Harbor, New York: Cold Spring Harbor Lab.]. 

Once the expression vector has been injected into the fertilized egg cell, the cell is 
implanted into the uterus of a pseudopregnant female and allowed to develop into an animal. 
Of the foimder transgenic animals bom, 70% carry the expression vector sequence in all of 
their cells, including the germ cells. The remaining 30% of the transgenic animals are 

20 chimeric in somatic and germ cells because integration of the expression vector sequence 

occurs after one or more rounds of replication. Heterozygous and homozygous animals can 
then be produced by interbreeding founder transgenics. This method has been successful in 
producing transgenic mice, sheep, pigs, rabbits and cattle [Hammer et al.^ (1986) J, Aninial 
Sci.:63:269; Hammer et al., (1985) Nature 315:680-683]. 

25 

!!• Retroviral Methods 
Retroviral infection of pre-implantation embryos with genetically engineered 
retroviruses may also be used to introduce transgenes into an animal cell. For example, 
blastomeres have been used as targets for retroviral infection [Jaenisch, (1976) Proc. Natl. 
30 Acad. Sci USA 73:1260-1264]. Transfection is typically achieved using a replication- 
defective retrovims carrying the transgene [Jahner et al., (1985) Proc. Natl. Acad. Sci. USA 
82:6927-6931; Van der Putten et al., (1985) Proc. Natl. Acad Sci USA 82:6148-6152]. 
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Transfection is obtained, for example, by culturing eight-cell embryos, from which the zona 
pellucida has been removed with fibroblasts which produce the virus [Van der Putten (1985), 
supra\ Stewart et aL, (1987) EMBO J. 6:383-388]. The transfected embryos are then 
transferred to foster mothers for continued development. Alternatively, infection can be 



[Jahner et aL, (1982) Nature 298:623-628]. Yet another alternative method involves 
intrauterine retroviral infection of the midgestation embryos [Jahner et aL (1982), supra]. 

The advantages of retroviral infection methods include the ease of transfection and 
the insertion of a single copy of the transgene, which is flanked by the retroviral long 

10 terminal repeats (LTRs), into the chromosome. However, this method is not a preferred 

method because most of the founders will show mosaicism since infection occurs after cell 
division has begun. This necessitates outbreeding to establish homozygous and heterozygous 
lines suitable for analysis of gene expression. More importantly, the retroviral LTR 
sequences may interfere with the activity of the hINV upstream sequences in directing 

15 expression of the heterologous nucleic aid sequences. 



Another method of introducing transgenes into the germ line involves using 
embryonic stem (ES) cells as recipients of the expression vector. ES cells are pluripotent 

20 cells directly derived from the inner cell mass of blastocysts [Doetchman et al., (1988) Dev. 
Biol. 127:224-227], from inner cell masses [Tokunaga et aL, (1989) Jpn. J. Anim. Reprod. 
35:173-178], from disaggregated morulae [Eistetter, (1989) Dev. Gro. Differ. 31:275-282] or 
from primordial germ cells [Matsui et al., (1992) Cell 70:841-847]. Expression vectors can 
be introduced into ES cells using any method which is suitable for gene transfer into cells, 

25 e.g., by transfection, cell fusion, electroporation, microinjection, DNA viruses, and RNA 
viruses [Johnson et al., (1989) Fetal Then 4 (Suppl. l):28-39]. 

The advantages of using ES cells include their ability to form permanent cell lines in 
vitro, thus providing an unlimited source of genetic material. Additionally ES cells are the 
most pluripotent cultured animal cells known. For example, when ES cells are injected into 

30 an intact blastocyst cavity or under the zona pellucida, at the morula stage embryo, ES cells 
are capable of contributing to all somatic tissues including the germ line in the resulting 



5 



performed at a later stage. Virus or virus-producing cells can be injected into the blastocoele 



Embryonic Stem Cell Implantation 



chimeras. 
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Once the expression vector has been introduced into an ES cell, the modified ES cell 
is then introduced back into the embryonic environment for expression and subsequent 
transmission to progeny animals. The most commonly used method is the injection of 
several ES cells into the blastocoel cavity of intact blastocysts [Bradley et aL, (1984) Nature 
5 309:225-256]. Alternatively, a clump of ES cells may be sandwiched between two eight-cell 
embryos [Bradley et aL, (1987) in "Teratocarcinomas and Embryonic Stem Cells: A 
Practical Approach," Ed. Robertson E.J. (IRL, Oxford, U.K.), pp. 113-151; Nagy et al, 
(1990) Development 110:815-821]. Both methods result in germ line transmission at high 
frequency. 

10 Target cells which contain the heterologous nucleic acid sequences are recovered, and 

the presence of the heterologous nucleic acid sequence in the target cells as well as in the 
animal is accomplished as described supra. 

E. Gene Therapy 

15 The regulatory nucleic acid sequences provided herein may be used for gene therapy 

applications in both non-human animals as well as in humans. For example, the regulatory 
nucleic acid sequences of the invention may be introduced into cells using an expression 
vector which encodes a polypeptide sequence of interest using a variety of means known in 
the art to be usefal both for delivery in vivo and ex vivo, including (1) recombinant retroviral 

20 transduction, (2) recombinant adenoviral vectors, (3) targeted cationic liposomes, and (4) 
gene transfer using biolistics, as described in the following sections. 

1. Recombinant Retroviral Transduction 

Retroviral vectors encoding polypeptides of interest may be used for the expression of 
25 the polypeptides in any desired cell, such as primary tumor cells. The transfer of 

polypeptides of interest using retroviruses may be made more efficient by increasing the titer 
of the virus encoding the polypeptides of interest and increasing the transduction efficiency. 
To increase the virus titer, the retroviral construct may be designed to include a selectable 
marker (e^., neo gene), and cells harboring the retroviral construct are selected by growth in 
30 the presence of a suitable selective agent (e.g., G418) followed by expansion of clones 

producing the highest titers of virus. To improve the transduction efficiency, retrovirus are 
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used in combination with liposomes or poly-L-omithine or polyiysine to enhance virus 
uptake. 

Another way to improve gene transfer efficiency using retroviruses is to increase the 
targeting efficiency. Many tumor cells including glioblastomas and melanomas express 
S excess levels of the transferrin receptor. Transferrin has been used to increase the 

transduction efficiency of adenovirus in combination with polyiysine. Several recent reports 
demonstrated that replacing the SU (surface) domain of the env gene of a retrovirus can 
increase receptor-mediated transduction efficiency. The human transferrin gene is 2097 bp 
long and its insertion into the SU domain of the env gene of MLV vector may not produce a 

10 stable Env product. However^ since earlier studies have suggested that the modified Env 

fusion protein requires the native Env for stable assembly and efficient entry, co-transfection 
of the transferrin-e/iv fusion gene with the native env gene may be used to produce retrovirus 
particles bearing a mixture of wild type and recombinant Env. The gene transfer efficiency 
of the new vector may be examined by transducing tumor cells expressing high levels of 

15 transferrin receptor. 

2. Recombinant Adenoviral Vectors 

Recombinant adenoviruses can accommodate relatively large segments of foreign 
DNA (-7 kb), and have the advantage of a broad host cell range and high titer virus 

20 production. Adenoviruses have been used in vivo in rats to efficiently deliver genes to the 
liver and the pancreatic islets [reviewed in Becker et al (1994) In Protein Expression in 
Animal Cells, Roth et al. eds.] and to the central nervous system [Davidson et ai (1993) 
Nature Genet. 3:219]. Rat livers have also been efficiently transduced ex vivo and then 
re-implanted [Shaked et al (1994) Transplantation 57:1508]. Thus, the present invention 

25 contemplates ex vivo transfection followed by transplantation of the transfected cells or 
organ. 

The replication defective recombinant adenoviruses are preferably employed; these 
viruses contain. a deletion of the key inmiediate early genes Ela and Elb. To generate and 
propagate recombinant viruses, a packaging cell line such as 293 cells which supply the Ela 
30 and E2a proteins in trans is employed. Recombinant adenoviruses are created by making 

use of intracellular recombination between a much larger plasmid encoding most of the viral 
genome and a small plasmid containing the nucleotide sequence of interest flanked by 
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regions of homology with the viral integration site. Standard methods may be used to 
construct the recombinant adenoviruses [Graham and Prevec (1991) Meth. Mol. Biol. 7:109- 
128; Becker et al (1994) In Protein Expression in Animal Cells, Roth et ai eds.]. Briefly, 
each plasmid is co-transfected together with pJMl? (Microbix Systems, Toronto) into 
5 sub-confluent monolayers of 293 cells (ATCC CRL 1573) using calcium phosphate 

precipitation and a glycerol shock. Initial recombinant viral stocks are titered on monolayers 
of 293 cells, and isolated single plaques are obtained and tested for expression of the 
polypeptide of interest using ELISA. Viral stocks are amplified and titered on 293 cells, and 
stored in aliquots at -70*^0; if necessary, stocks' are concentrated by centrifugation on density 

10 gradients. To infect tumor cells with recombinant adenoviruses, freshly isolated tumor cells 
are mixed with adenoviral stocks in a minimal volume. Titers of stocks are typically 1 0^- 
lOVml. Medium is replaced after several hours and the cells are followed for expression of 
the recombinant adenoviral-encoded polypeptide of interest {e.g., reporter genes). 

A potential drawback of using an adenoviral delivery system is that the transduced 

15 cells may retain or express small quantities of adenoviral antigens on their surface. "Second 
generation" adenoviral vectors which contain deletions in the E2a gene are available and are 
associated with less inflammation in the recipient and a longer period of expression of the 
gene of interest [Engelhardt et aL (1994) Proc. Natl. Acad. Sci. USA 91 :6196]. If 
necessary, nucleic acid sequences encoding polypeptides of interest are inserted into second 

20 generation adenoviral vectors. 

Recently, adenoassociated virus (AAV) vectors and chimeric lentivirus vectors have 
also been shown promise in the expression of polypeptide sequences of interest, 

3. Targeted Cationic Liposomes 

25 Cationic liposomes have proven to be a safe and effective means for inducing the 

transient expression of DNA in target cells [Ledley (1995) Human Gene Ther. 6:1129]. 
Clinical trials aire underway using cationic liposomes to introduce the CFTR gene into the 
lungs of cystic fibrosis patients [Caplen et al. (1994) Gene Then 1:139 and Alton et al 
(1993) Nature Genet. 5:135] or to introduce, by direct intra-tumor injection, the T cell 

30 costimulator B7-1 into malignant melanoma lesions in order to induce a cell-mediated 
immune response [Nabel et al, (1993) Proc. Natl. Acad. Sci. USA 90:11307]. 
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Cationic liposomes (e,g., DOTAP/DOPE) and ligand-targeted cationic liposomes may 
be employed for the delivery of polypeptides of interest to tumor cells. Recently, in addition 
to cationic liposomes, neutral liposomes have also been reproted to also be useful in targeing 
ligands to cells. Ligand-targeted hposomes are made by covalently attaching Hgands or 
S antibodies to the surface of the cationic liposome. For example, when glioblastoma cells are 
to be targeted, transferrin is used as the ligand as glioblastoma cells express high levels of 
the transferrin receptor on their surface. When melanoma cells are to be targeted, 
internalizing receptors, monoclonal antibodies directed against melanoma-specific surface 
antigens {e.g., mAb HMSA5) may be employed as the ligand. 

10 Plasmid DNA encoding polypeptides of interest is formed into a complex with 

preformed cationic liposomes using standard methodology or alternatively the DNA is 
encapsulated into the liposome interior. The DNA-containing liposomes are then used to 
transfer the DNA to tumor cells in vivo by direct intra-tumor injection or in vitro (using 
freshly explanted tumor cells) followed by return of the transduced cells to the recipient 

15 (e.g., a human patient or non*human animal). 



4. Gene Transfer Using Biolistics 

Biolistics (microballistics) is a method of delivering DNA into cells by projection of 
DNA-coated particles into cells or tissues. DNA is coated onto the surface of gold or 

20 tungsten microparticles (—1-3 jiim diameter) and these particles are accelerated to high velocity 
and are impacted onto the "target cells. The particles burst through the cell membrane and 
lodge within the target cell. The cell membrane quickly reseals and the passenger DNA 
elutes off of the particle and is expressed. The biolistic method has been used to transfect 
mammalian cells [Sanford et al. (1993) Methods Enzymol. 217:483]. 

25 A hand-held biolistic apparatus (BioRad) is used to transfer DNA into tumor cells or 

isolated tumor fragments. This device uses compressed helium to drive a disc-shaped 
macroprojectile which carries on its surface microparticles (1-5 \im) of gold which have 
been coated with purified plasmid DNA (coprecipitated with spermine) (Williams et al., 
supra). This apparatus has been used to successfully transfect primary tissues. 

30 Plasmid DNA encoding the polypeptides of interest may be coated onto the surface of 

gold microparticles according to the manufacturer's instructions (BioRad) and the biolistic 
apparatus is used to transfer the DNA into freshly explanted tumor cells or directly into 
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exposed tumors (e.g.y metastatic nodules on the surface of the liver, melanoma lesions on the 
skin). 

Regardless of the method of delivery of the expression vector into a cell, it is 
preferred, though not required, that the expression vector contain a selection marker (e.g., 
5 neo gene) to facilitate selection of transfected cells. Transfected cells are selected by growth 
in the presence of G418 {e.g., 200 |ig/ml), followed by culture in growth medium containing 
reduced concentrations of G418 {e.g., 100 ^ig/ml) and growth to confluence. Expression of 
the polypeptides of interest is evaluated using, for example, immunoblot analysis or flow 
cytometry using monoclonal antibodies which are specific for the polypeptides of interest. It 

10 is preferred, though not necessary, that expression of the polypeptides of interest in the 
transfected tumor cells is both constitutive and stable. Constitutive expression refers to 
expression in the absence of a triggering event or condition, and can be achieved by the 
selection of a promoter which drives expression of the nucleic acid sequence encoding the 
polypeptides of interest. Examples of promoters which drive constitutive expression of a 

15 structural nucleic acid sequence which is operably linked to the promoter include the SRa 
promoter, CMV promoter, and HIV promoter. 

Regardless of the type of expression vector used for delivery of the nucleic acid 
sequences of interest into a cell, the expression vector may be introduced to the cell by direct 
injection into tumor and/or preneoplastic tissue, systemic {e.g., intravenous) administration, 

20 aerosol administration (e.g., for delivery to the bronchial tree and other lung tissues), 

injection into breast ducts {e.g., for delivery to breast tissue), and topical. administration {e.g., 
for delivery to cervical tissue). 

F. Reducing Expression Of Factor IX In An Animal 

25 The regulatory sequences of the invention may also be used to reduce expression of a 

polypeptide sequence of interest which is encoded by a nucleic acid sequence whose 
transcription is under the regulatory control of the regulatory sequences provided herein. For 
example, the regulatory sequences of the invention may be used to reduce the rate of age- 
related increase of EIX activity in an animal as a means of treating diseases (e.g., 

30 thrombosis, cardiovascular disease, etc.) which are associated with age-related increases in 

FIX activity. Since the inventors have discovered that the exemplary nucleic acid sequences 
AE5' and AE3' regulate Stable and increased expression levels, respectively, of hFIX, the 
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increase in the level of hFIX activity over time may be reduced by inhibiting the function of 
AE3* which regulates increased expression of hFIX. This approach has the advantage that 
expression of hFIX remains under the control of AE5' thus providing hFIX activities which 
are stable over time and which continue to play an important role in normal blood 
5 coagulation processes. 

The function of AE3' in age-related expression of FIX may be inhibited by, for 
example, inhibiting the activity of the protein which specifically binds to AE3\ The 
protein(s) which bind to AE3' may be identified by using the AE3' (or the minimum portion 
of AE3* which has age*related regulatory activity) to screen protein libraries for specific 

10 binding to AE3' or its portion. Once the protein which binds to AE3' is identified, the 

function of this protein may be inhibited using antibodies which are specific for this protein. 
Antibodies which are specific for the protein which binds to AE3' are expected to disrupt the 
interaction between AE3* and this protein. 

Antibodies (polyclonal and monoclonal) which are specific for the protein that binds 

15 to AE3' or portions thereof may be generated using methods known in the art. The term 
"antibody" refers to immunoglobulin evoked in animals by an immunogen (antigen). It is 
desired that the antibody demonstrates specificity to epitopes contained in the immunogen. 
The term "polyclonal antibody" refers to immunoglobulin produced from more than a single 
clone of plasma cells; in contrast "monoclonal antibody" refers to immunoglobulin produced 

20 from a single clone of plasma cells. The terms "specific binding," "specifically binding" and 
grammatical equivalents thereof when used in reference to the interaction of an antibody and 
. an immunogen mean that the interaction is dependent upon the presence of a particular 
structure (/.e., the antigenic determinant or epitope) on the immunogen; in other words the 
antibody is recognizing and binding to a specific immunogen structure rather than to 

25 immunogens in general. For example, if an antibody is specific for epitope "A", the 
presence of an immunogen containing epitope A (or free, unlabelled A) in a reaction 
containing labelled "A" and the antibody will reduce the amount of labelled A bound to the 
antibody. 

Polyclonal and monoclonal antibodies which are specific to a desirable polypeptide, 
30 given the teachings herein, may readily be prepared by one of skill in the art. For example, 
monoclonal antibodies may be generated by immunizing an animal {e.g., mouse, rabbit, etc.) 
with a desired antigen and the spleen cells from the immunized animal are inunortalized. 
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commonly by fusion with a myeloma cell. Immunization with antigen may be accomplished 
in the presence or absence of an adjuvant, e.g., Freund's adjuvant. Typically, for a mouse, 
10 )ag antigen in 50-200 (al adjuvant or aqueous solution is administered per mouse by 
subcutaneous, intraperitoneal or intra-muscular routes. Booster immunization may be given 
5 at intervals, e^., 2-8 weeks. The final boost is given approximately 2-4 days prior to fusion 
and is generally given in aqueous form rather than in adjuvant. 

Spleen cells from the immunized animals may be prepared by teasing the spleen 
through a sterile sieve into culture medium at room temperature, or by gently releasing the 
spleen cells into medium by pressure between the frosted ends of two sterile glass 

10 microscope slides. The cells are harvested by centrifugation (400 x g for 5 min.), washed 
and counted. Spleen cells are fiised with myeloma cells to generate hybridoma cell lines. 
Several mouse myeloma cell lines which have been selected for sensitivity to hypoxanthine- 
aminopterin-thymidine (HAT) are conmiercially available and may be grown in, for example, 
Dulbecco's modified Eagle's medium (DMEM) (Gibco BRL) containing 10-15% fetal calf 

15 serum. Fusion of myeloma cells and spleen cells may be accomplished using polyethylene 
glycol (PEG) or by electrofusion using protocols which are routine in the art. Fused cells 
are distributed into 96- well plates followed by selection of fused cells by culture for 1-2 
weeks in 0.1 ml DMEM containing 10-15% fetal calf serum and HAT. The. supematants are 
screened for antibody production using methods well known in the art. Hybridoma clones 

20 from wells containing cells which produce antibody are obtained, e.g., by limiting dilution. 
Cloned hybridoma cells (4-5 x 10^) are implanted intraperitoneally in recipient mice, 
preferably of a-BALB/c genetic background. Sera and ascites fluids are collected from mice 
after 10-14 days. 

The invention also contemplates humanized antibodies which may be generated using 
25 methods known in the art, such as those described in U.S. Patent Numbers 5,545,806; 

5,569,825 and 5,625,126, the entire contents os which are incorporated by reference. Such 
methods include, for example, generation of transgenic non-human animals which contain 
human immunoglobulin chain genes and which are capable of expressing these genes to 
produce a repertoire of antibodies of various isotypes encoded by the human immunoglobulin 
30 genes. 
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Alternatively, the function of AE3' in age-related expression of FIX may be inhibited 
by, for example, inhibiting the activity of AE3' using antisense sequences which are directed 
to AE3\ The term "antisense" as used herein refers to a deoxyribonucleotide sequence 
whose sequence of deoxyribonucleotide residues is in reverse 5* to 3' orientation in relation 
5 to the sequence of deoxyribonucleotide residues in a strand of a DNA duplex. AE3' 

antisense sequences may be used to turn off genes under the expression regulation of AE3* 
by transfecting a cell or tissue with expression vectors which express high levels of a desired 
AE3* antisense oligomer (e.g., 15-20 nucleotides) or larger fragment. Such constructs can 
flood cells with antisense sequences which inhibit expression of FIX. Antisense sequences 

10 can be designed from various locations along the AE3' sequence. Animals {e.g., mice) 

treated with vectors expressing AE3* antisense sequences are monitored for changes in the 
age-related symptoms associated with FIX expression. The alleviation or treatment of one or 
more of these symptoms in animal by an antisense sequence suggests that the antisense 
sequence may be useful in the treatment and/or prevention of age-related FIX expression in 

15 humans. 

EXPERIMENTAL 

The following examples serve to illustrate certain preferred embodiments and aspects 
of the present invention and are not to be construed as limiting the scope thereof. Unless 
20 otherwise mentioned, all reference to nucleotide numbers with respect to the factor IX 

nucleotide sequence, refers to the nucleotide numbers of the hFlX gene sequence shown in 
Figure 8. 

EXAMPLE 1 

25 Construction Of A Series of Twelve Exemplary Human Factor IX (hFIX) Minigene 

Expression Vectors 

To explore the molecular mechanisms underlying age-related regulation of Factor IX, 
a series of twelve hFIX minigene expression vectors were constructed. These vectors were 
first analyzed in vitro in HepG2 cells, a human hepatoma cell line (see Example 2, infray 
30 Transgenic mice harboring the hFIX minigene vectors were generated and longitudinal 

analyses of hFIX expression for the entire life spans of founders and successive generations 
of transgenic mice were carried out (See Example 3, infra). 
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The twelve exemplary minigenes contained sequences derived entirely from the hFIX 
gene sequence, including (a) promoter sequences of various lengths spanning up* to 
nucleotide (nt) -2231 in the 5' flanking region, (b) the coding region containing a first intron 

t>. in which the first intron*s middle portion is truncated, i.e., nt +1098 through nt +5882 of 

5 Figure 8, and (c) either the complete 3* UTR sequence or the 3' UTR sequence in which the 
middle portion was deleted. Figure 1 shows the structure of eleven out of the twelve human 
FIX minigene expression constructs. The name of each construct is shown at left. The 
stmcture is depicted with the promoter-containing regions (solid thick line on left) with the 
5' terminal nucleotide number. Transcribed hFIX regions (open rectangles connected with 

10 thin lines representing the shortened first intron) are followed by 3' flanking sequence 

regions (solid thick line at right). Arrow: transcription start site; asterisk: translation stop . 
codon; pA: polyadenylation; si: potential stem-loop forming dinucleotide repeats. 



sequence previously reported (Yoshitake et al (1985) Biochem, 24:3736-3750). Minigene 
-416FIXml/1.4 was constructed from -416FEXml by inserting the middle portion of the 3' 
UTR (1.2 kb in size) which was generated by PCR using the following primer set with 
BamH I linkers: 5' primer, TAACAGGATCCGGCCTCTCACTAACTAATCAC (nt +31418 

20 through +31438) (SEQ ID NO:14) and 3' primer, 

CAACTGGATCCAAGATTCAAGATAGAAGGAT (nt +32690 through +32671) (SEQ ID 
NO: 15), and human genomic DNA as an amplification template. The PCR product was 
digested with BamH I, and the generated fragment W2is inserted into the 3 ' UTR BamH I site 
of -416FIXml, thus producing -416FIXml/1.4 which contained the entire 3' UTR. -416FIX 

25 ml/0.7 was constructed by inserting the PCR-amplified 700 bp fragment with BamH I 
linker, containing the 3' contiguous sequence to nt +32117. The primers used were, 5' 
. primer: same as that for -416FIXml/l,4, 3 primer: GGACAGGATCCCC 
CAAACTTTTCAGGCAC (nt +32117 through +32097) (SEQ ID NO:16). Minigenes 
-590FIXml, -679FIXml, -770FIXml, -802FIXml and -2231FIXml were produced by 

30 replacing the 5^ end 433 bp sequence of -416FIXml released by Sph I/Nhe 1 digestion with 
607, 696, 787, 819 and 2248 bp fragments containing the 5' end hFIX region extended up to 
nt -590, -679, -770, -802 and -2231, respectively. These latter fragments were generated by 



15 



Construction of hFIX minigene expression vectors was carried out using -416FIXml 
as the starting construct (Kurachi et al (1995) J. Biol. Chem. 270:5276-5281). The 
nucleotide (nt) numbering system used in this study was based on the complete hFIX gene 
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Sph I/Nhe I digestion of the PGR product obtained with 5' primers: 

CAAGCATGCATCTAGTGTTAGTGGAAGAG (nt -590 through -571) (SEQ ID NO:17). 
CAAGCATGCAAATATTAACTCAAAATGGA (nt -679 through -660) (SEQ ID NO: 18), 
CAAGCATGCTGTTGTTTTTTGTTTTAAC (nt -770 through -752) (SEQ ID NO: 19), 
5 CAAGCATGCAGCCATTCAGTCGAGGAAGG (nt -802 through -783) (SEQ ID NO:20), 
CAAGCATGCGATCCCTTCCTTATACCT (nt -2231 through -2214) (SEQ ID NO:21) with 
Sph I linker and the common 3' primer TAAGCTTAACCTTTGCTAGCAGATTGT (nt +30 
through +10) (SEQ ID NO:22) and human genomic DNA as the amplification template. 
Minigene -802FIXml/0.7 (whose structure is not shown in Figure 1> contains the 3' UTR 

10 region through nt 32,140, which is then connected to nt 32,690 through its downstream poly 
(A) signal sequence that is conmion to each of the other eleven constructs. 

-416FIXml/AE5' depicts a construct with the AE5' region moved to the 3 '-end 
position and shown as an open box at right. -416FIX ml/AE5' was constructed by inserting 
the Kpn I fragment generated by PGR (nt -802 through nt -417) into the -416FIXml vector 

15 (the Kpn I site is outside of the FIX gene, Figurel). The 5' and 3' primers used for PGR 
were CTTGGTACCAGCCATTCAGTCGAGGAAGG (nt -802 through -783) (SEQ ID 
NO:23) and CTTGGTACCATATGAATCCTTTCATAGAT, (nt -417 through -436) (SEQ ID 
NO:24) respectively. All constructs were sequenced through PGR amplified regions as well 
as fragment ligation sites to confirm the correct sequences and orientations. 

20 

EXAMPLE 2 

Transient Expression of Eleven hFIX Minigene Expression Vectors In Vitro In Human 

Hepatoma HepG2 Ceil Line 

25 Transient in vitro expression activities of hFIX minigene constructs were assayed, 

using HepG2 cells and hFIX specific enzyme linked immunosorbent assay (ELISA) as 
previously described (Kurachi et al. (1995) J. Biol. Chem. 270:5276-5281) with some 
modifications. Cell transfection was carried out by the calcium phosphate-DNA conjugate 
method or later, using FuGene 6 (Boehringer Manheim). The latter, improved transfection 

30 method consistently increased transfection efficiency to >20% (Kurachi et al, (1998) 

Biochemica 3:43-44), and all earlier assays were reexamined using FuGene 6. Four to five 
independent assays of factor IX activity were carried out and the averages were shown with 
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Standard errors. With FuGene 6 transfection, the control minigene -416FIXml typically 
produced hFIX at a level of -50 ng/10^ cells/48 hr. 

Figure 1 shows the relative in vitro transient expression activities of the human FIX 
minigene expression constructs (transient expression activity of minigene -802FIXml/0.7 
5 which is not shown in Figure 1 was 81.5% of the acitity of minigene -416FIXml). 

Transient expression activities relative to the activity of -416FIXml (--50 ng/10^ cells/48 hour, 
and defined as 100% activity) are shown on the right side with standard deviations (from 4-5 
independent assays). Activities were normalized to the size of minigenes used. 

The relative transient expression activities shown in Figure 1 show that all constmcts 

10 showed comparable high transient hFIX expression in HepG2 cells (-50 ng/lO^ cells/48 

hours). However, all the constructs containing the complete 3' UTR, including a 102 base 
pair (bp) stretch of inverted AT, GT and GC dinucleotide repeats [Yoshitake et al (1985) 
Bioehem. 24:3736-3750], reproducibly showed expression activity levels which were 25-30% 
lower than corresponding minigenes without the repeat sequences. Dinucleotide repeats 

15 similar to those seen in the hFIX 3' UTR, which can form stable stem-loop (si) structures in 
mRNA, have been implicated in controlling mRNA stability in mammals as well as yeast 
and plants, thus providing an important layer of protein biosynthesis regulation [Ross (1995) 
Microbiol. Rev. 59:423-450]. Together, these results suggest a similar negative regulatory 
activity for this structure of the hFIX gene in the HepG2 assay system on expression of the 

20 hFIX gene. As described below {e,g,. Example 3), however, the 3' UTR structure of the 
hFIX gene containing the dinucleotide repeat region showed unexpected functions in vivo 
which are critical for advancing age-related regulation of the hFIX gene. 

Another important and surprising finding with the Hq>G2 cell assay system is that 
expression by these hFIX minigenes (which contained sequences which are positioned 

25 upstream and downstream of the hFIX gene, and which are derived from the homologous 

hFIX gene instead of from heterologous reporter genes) does not show any down-regulation 
in the presence of the 5' upstream region (nt -802 up through nt -1900) [Salier et al (1990) 
J. Biol. Chem. 265:7062-7068] (Figure 1), In contrast, when a CAT reporter gene was used, 
negative regulatory elements were identified in this region [Salier et al, (1990) J. Biol. 

30 Chem. 265:7062-7068]. 
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EXAMPLE 3 

Generation And Analysis Of Transgenic Mice Harboring hFlX Minigene Expression 

Vectors 

Transgenic animals were constructed using the expression plasmids described above 
in Example 2 according to standgwd methods [Hogan et al. (1994) in "Manipulating the 
Mouse Embryo, a Laboratory Manual" (Cold Spring Harbor Press, New York, 2nd Edition). 
All animal experiments were carried out in accordance with the institutional guidelines of the 
University of Michigan (OPRR No. A31 14-01). 

Briefly, Factor IX minigene expression plasmids were double-digested with Sph 
I/Kpn I and the factor IX minigene-containing fragments released were isolated by 0.8% 
agarose gel electrophoresis, followed by purification with SpinBind DNA extraction units 
(FMC). Fertilized eggs of C57B/6 X SJL mice were microinjected with the DNA (1-2 
ng/egg), and implanted into foster mother animals. (CD-I). 

A. Multiplex PGR Analysis 

Offspring produced were screened for founder animals with high transgene copy 
numbers (5-10 copies per genome) using quantitative multiplex PGR analyses of tail tissue 
DNA samples. Two pairs of primers were used, one specific to the hFTX transgenes and the 
other specific to mouse p-globin gene (endogenous control); 5' primer: 
GTGTGGGAAGAGACAGATTTTGG (nt +6172 through +6195) (SEQ ID NO:25) and 3' 
primer: GGAATAATTCGAATCACAT (nt +30885 through +30867) (SEQ ID NO:26), and 
5' primer: CCAATCTGCTCACACAGGAT (nt +2590 through +2609) (SEQ ID NO:27) and 
3' primer: CCTTGAGGCTGTCCAAGTGA (nt +3083 through +3064) (SEQ ID NO:28), 
respectively. These primers were designed to amplify a unique 966 bp fragment from the 
hFIX transgenes and a 494 bp fragment from the mouse P-globin gene, respectively. PGR 
was initiated with 3 min incubation at 94^ C, followed by 25 cycles of 94'' C for 30 sec, 65'' 
C annealing for 1 min and 72*" C extension for 2 min. 

Founders were back-crossed with non-transgenic mice (G57B/6 X SJL) to generate Fl 
progeny animals. Homozygous F2 animals were generated by crossing among heterozygous 
Fl littermates and the following generations were similarly generated. Zygosity status of 
animals was determined by quantitative multiplex PGR analysis as described above. 
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Minimally, three founder lines for each minigene construct were subjected to longitudinal 
analysis for their entire life spans up to two years. 

Figure 3B shows the results of quantitative multiplex PGR analysis to determine the 
relative transgene levels in tail and liver tissues. Genomic DNA was extracted from snipped 
5 tail tissue of a transgenic. -4 16FIXml animal (PA112) at 3 weeks and at 19 months of age. 
Liver DNA was extracted from the same animal (PA 112) sacrificed at 19 months of age and 
a -416FIXml animal (PA412) sacrificed at 1 month of age. Positions of hFIX specific 
fragment (966 bp) and mouse p-globin specific fragment (494 bp, internal copy number 
control) are shown on the right. Lane 1: kb size ladder ; lane 2: fragment size control 

10 amplified from -416FTXml plasmid; lane 3: non-transgenic mouse tail DNA as template; 

lane 4: tail DNA of PAl 12 at 3 weeks of age; lane 5: tail DNA of PAl 12 at 19 months of 
age; lane 6: liver DNA of PA412 at 1 month of age; lane 7: Hver DNA of PAl 12 at 19 
month of age. The relative transgene copy numbers for the 1 month-old versus the 19 
month-old animals, normalized to the endogenous mouse P-globin gene, were 1.0-1.1 for 

15 both tail as well as liver genomic DNA preparations, showing no sign of loss of the hFIX 

transgene in the genome with age (Multi Analyst program from BioRad used for quantitation 
and calculation of ratios). 

B. Immunoassay of hFIX Levels In Transgenic Mice 
20 Circulatory hFIX levels were monitored during longitudinal analyses of transgenic 

mice from the representative foimder lines carrying various hFIX minigene transgenes. At 
various ages, starting at one month of age, transgenic mice were individually subjected to 
blood sample collection (ahquot of -100 jal) via tail-tip snipping, and the obtained serum was 
routinely used to quantify hFIX levels in the circulation using duplicated hFIX-specific 
25 ELISA for each age point. Pooled human plasma (George King Bio-Medical) was used to 
prepare a hFIX standard curve for each assay. In order to minimize experimental 
fluctuations from assay to assay in the longitudinal analysis, overlapped serum samples from 
the previous assay group were included in each assay. To ensure reproducibility, three to six 
independent founder lines were generated for each minigene construct, and animals from at 
30 least three representative lines were subjected to longitudinal analyses for their entire life 
spans. The duplicated ELISA values varied' less than 11% from the averages. The results 
are shown in Figures 2 and 4. 
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In all panels in Figures 2 and 4, labeling of animals is based on the tag numbers plus 
additional information. The first letters of the label F or P represent founder or progeny, 
respectively. Information on progeny generation (Fl or F2) and sex are in parenthesis (m: 
male; f: female), followed by status (+: alive in good health; d: died; s: sacrificed for various 
5 examinations; mo: age of death or sacrifice). To avoid overcrowding of the panels, the 

results from representative animals are shown for each minigene construct. Importantly, age- 
regulation patterns were remarkably similar among all animals for each specific construct 
and different founder lines. Panels A-E of Figure 2 show representative founder line animals 
with -416FIXml (A); -416FIXml/1.4 (B), -590 FIXml (C), -679FIXml (D) and -TTOFIXml 

10 (E). Panels A-D of Figure 4 show representative founder line animals with -802FIXml, 
-802FIXml/1.4, -2231 FIXml and -2231 FIXml /1 .4, respectively. Panel E shows 
representative founder line animals with -416FIXml/AE5'. 

Figure 2 shows that at one month of age, the mice carrying the -416FIXml minigene 
produced hFIX at varying levels, from as high as that of natural hFIX gene expression (-4 

15 |ig/ml) to much lower levels (-50 ng/ml) (Figure 2 A). Such variations are primarily due to 
the transgene positional effects in the genome. Circulatory hFIX levels of animals from the 
representative founder lines carrying the minigene, however, declined drastically through 
puberty and during the subsequent two to three month period to much lower levels, which 
then remained stable for the remaining life span. This rapid age-dependent characteristic 

20 decline in the circulatory hFIX level was observed in all animals analyzed (n=69), regardless 
of founder line, differences in initial hFIX level at pre-pubertal age (one month) due to 
transgene positional effects, generation (founders and Fl or F2 progeny), sex, or zygosity 
status (homozygous / heterozygous) of the transgenes. 

25 C. Northern Blot Analysis of hFIX mRNA in Transgenic Mice 

Northern blot analyses of the liver RNA samples from animals (15 |ig per lane) were 
carried out as previously described [Kurachi et al. (1995) supra] using the ^^P-labeled Ssp 
I/BamH I fragment (the 3' half of the hFIX coding region of the cDNA) as a probe, and 
employing stringent washing conditions. Under these conditions, the probe preferentially 
30 hybridized strongly with hFIX minigene mRNA bands (-1.7 kb) with little cross-hybridization 
with the mouse FIX mRNA bands (3.2 kb and 2.2 kb) [Yao et al. (1994) Gene Therapy 
1:99-107]. To confirm the presence of equivalent amounts of RNA in each lane, the filters 
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previously hybridized with hFIX probe were stripped of probe and re-probed with the 
RNR18 cDNA (ribosomal RNA 18S). After completion of longitudinal analyses of animals 
from key founder lines for their entire life spans, the representative lines were subjected to 
embryo-freezing for preservation. 
5 The results of Northern analysis of human FIX mRNA and transgene DNA levels in 

the livers of animals carrying -416FIXml are shown in Figure 3 A. hFIX mRNA levels in 
the liver of young (PA412: Fl/f, 1 month of age) and old (PA112; Fl/f, 19 months of age) 
transgenic animals were analyzed by Northern blot analysis of total liver RNA. PA412 and 
PAl 12 animals were from the same litter produced by the founder FA661, and expressed 

10 1252 and 1675 ng/ml circulatory HFIX at one month of age, respectively. PAl 12 was 

expressing 63.8 ng/ml serum HFIX at the time of sacrifice. Lane 1: non-transgenic mouse 
liver RNA; lane 2: transgenic PA412 liver RNA; lane 3: transgenic PAl 12 liver RNA. FIX 
and 18S on the left or right sides indicate the band position of hFIXml mRNA (-1.7 kb) and 
RNR18 (1.9 kb, ribosomal RNA), respectively. 

15 Figure 3 A shows that the decline in blood hFIX level observed in Figure 2 was 

correlated with a similar decline in the steady-state liver hFIX mRNA, which was not due to 
a loss of the hFIX transgene with age (Figure 3B). This was fiirther supported by the fact 
that when 4-5 month old mice with much decreased hFIX levels had progeny, their pups 
depicted pre-pubertal high hFIX expression levels equivalent to those of their parents at the 

20 same time point (one-month of age). 

Minigene vector -416FIXml/1.4 is identical to -416FIXml except that 
-416FIXml/1.4 contains the complete 3'UTR, including the dinucleotide repeat structure 
(102 bp in length) in its middle region [Yoshitake et al (1985) Biochem. 24:3736-3750] 
(Figure 1). Transgenic mice with -416FIXml/1.4 (n=48) (Figure 2B) showed pre-pubertal 

25 high and subsequent age-dependent decline in hFIX levels similar to those of -416FIXml 
(Figure 2A), although the decline was less steep and expression levels were stabilized at 
significantly higher levels than those observed for -416FIXml (Figure 2B). 

These results indicate that, while the 102-bp sequence containing the dinucleotide 
repeat structure of hFIX 3' UTR reduces the age-related decline in expression of hFIX, the 

30 presence of the complete 3' UTR containing the extensive dinucleotide repeat structure 

nonetheless does not completely rescue hFIX expression from the age-decline observed in all 
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of these animals, regardless of founder line, initial pre-pubertal hFIX level, generation, sex, 
or zygosity status of the transgenes. 

All animals carrying minigenes -590FIXmland -679FIXml (a total of 25 and 26 
animals subjected to longitudinal analysis, respectively) also showed an age-associated rapid 
5 decline in hFIX expression similar to that seen in animals carrying -416FIXml (Figure 2, C 
and D). Furthermore, hFIX expression levels in three independent founder animals 
generated to date carrying -770 FIXml also rapidly decreased over the puberty period in a 
similar pattern as the above minigenes (Figure 2E). these observations indicated that 
minigenes with the promoter region up to nt -770 contain the basic structural elements 

10 necessary for hFIX expression, but lack a structural element(s) which functions in age- 
associated stability of hFIX gene expression. 

In contrast, striking and unexpected differences in hFIX expression pattems were 
observed with animals carrying the minigene -802FIXml (Figure 4A) as comp[ared to those 
carrying the minigene -416FIXml (Figure 2A). -802FIXml is composed of a vector frame 

15 identical to -416FIXml, except that the 5' end flanking sequence included was extended to 
nt -802 (Figure 1). 

All animals >yith -802FIXml, -2231FIXml and -416FIXml/AE5' (panels A, C, E) 
exhibited stable expression throughout their life spans. Animals with -802FIXml/1.4 and 
-2231FlXml/1.4 (Figure 4 B, D) exhibited age-associated increases in hFIX expression 

20 levels. All animals maintained or increased stable circulatory hFIX levels regardless of 

founder line, initial expression levels at one month of age, sex, generation or zygosity status. 
Mice which died at much younger ages than their normal life expectancies are marked with 
d. The above results show that all animals from three independent founder lines obtained 
with -802FIXml (Figure 4 A) showed characteristic differences in hFIX expression pattern 

25 from animals with -416FIXml (Figure 2A) and -416FIXml/L4 (Figure 2B). 

The -802FIXml transgenic animals (n = 62) subjected to longitudinal analysis 
invariably showed age-stable plasma hFIX levels for their entire life spans, mostly up to 20- 
24 months of age. Age-stable circulatory hFIX levels were correlated with age-stable 
mRNA levels (Figure 5). These observations with -802FIXml were further supported by 

30 age-stable hFIX expression by mice carrying -2231 FIXml (Figure 4C ). Together, these 
results suggest that the structural element which is responsible for age-stable expression of 
the hFIX gene resides in the small region spanning nt -770 through -802. We designated . 
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this small region "age-regulatory element in the 5' end" (AE5'). This region contains a 
transcription factor PEA-3 nucleotide sequence (GAGGAAG: nt -784 through -790), which 
completely matches the consensus motif (C/G)AGGA(A/T)G [Martin et ai (1988) Proc. 
Natl. Acad. Sci. 85:5839-5843; Xin et al (1992) Genes & Develop. 6:481-496; Chotteau- 
5 Lelievre et a/.(1997) Oncogene 15:937-952; Gutman and Wasylyk (1990) EMBO J. 9:2241- 
2246]. The function of AE5' nucleotide sequence is position-independent as shown by age- 
stable hFIX expression by animals containing -416FIXml/AE5', in which AE5' was moved 
to the 3 ' end outside of the hFIX minigene (Figure 4E). 

Since transgenes of -416FIXml, -590FIXml, -679FIXml and -770FIXml differ from 

10 the minigenes -802FIXml and -2231FIXml only by their promoters, the hFIX mRNA 

produced from all of these minigenes (an intron spliced form of FIXml RNA) was expected 
to produce identical hFIX protein. Thus, it was hypothesized that the age-dependent decline 
in the circulatory hFIX level observed in animals with -416FIXml, -590FIXml, -679FIXml 
and -770FIXml, but not with -802FIXmI and -2231 FIXml, must be due to an age- 

15 dependent decline in the transcriptional activity of the transgenes. This agrees with the facts 
that no significant changes with age in hFIX mRNA levels in the liver were observed for 
animals carrying -802FIXml (Figure 5, lanes 2 and 3), while advancing age-dependent 
declines in steady-state mRNA level were observed for -416FIXml (Figure 3 A, lanes 2 and 
3). 

20 To further determine whether the age-dependent decline in the circulatory hFIX levels 

was due to an age-dependent decline in transcriptional activity of the transgenes, the effects 
of age on hFIX clearance from the circulation were tested as follows. Aliquots of plasma- 
derived hFIX preparation (5 jag/O.l ml of PBS) were injected via tail vein into normal 
animals at 2, 9-10 and 19-23 months of age (n=3 per age group), which have the same 

25 genetic background as the transgenic mice (C57B/6 X SJL). The hFIX level in circulation 

was monitored by ELISA of collected blood samples (--50 \i\ aliquot) at 10 min, 2, 6, 12, 18, 
24, 30, 36 and 48 hrs after protein injection. As expected, all animals of different age 
groups showed a typical bi-phasic clearance kinetics (two compartment distribution and 
clearance) with an initial rapid clearance phase (a-phase), followed by a slower clearance 

30 phase (p-phase). The results are shown in Table 1. 
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Table 1 

Clearance Time of Human Factor IX in Mice 



Age (months) 


Clearance Time 




(T,/2 of Human Factor IX) 


2 


16.8 ± 0.21 


9-10 


17.4 ± 0.55 


19-23 


16.9 ± 0.35 



As shown in Table 1, very similar half clearance times were observed for all age groups 

10 tested. This agreed with our previous results (17.8 hours) for hFIX clearance in a different 
strain, BALB/c mice (2 months of age) (Yao et al. (1994) supra]. 

Furthermore, the results in Table 1 demonstrate that the hFIX turnover time from the 
circulation does not change significantly in vivo with increasing age, from youth (2. months), 
to middle age (9-10 months) to old age (19-23 months). These results further confirm that 

15 the age-dependent decline in the circulatory hFIX levels was due to an age-dependent decline 
in transcriptional activity of the transgenes. 

It is important to note that in the in vitro HepG2 cell assay system, the presence or 
absence of AE5 ' in the minigenes did not make any significant difference in hFIX 
expression ifrom the hFIX minigenes (Figure 1, and Example 2, supra). In contrast, as 

20 mentioned above in this Example, the presence or absence in vivo of AE5 ' makes a dramatic 
age-dependent difference in hFIX gene expression. This ftirther demonstrates that in vivo 
longitudinal analysis is important for studying age-regulation of a gene. 

Unlike the animals with -802FIXml, mice with -802FIXml/1.4 (which contains the 
complete 3'UTR) showed an advancing age-associated increase in the hFIX level in the 

25 circulation (n=48) (Figure 4 A and B). Thus, to determine whether this unexpected age- 
dependent increase in the circulatory hFIX level was directly correlated with an increased 
level of liver hFIX mRNA, Northern blot analyses of transgenic mice carrying -802FIXml 
and -802FIXmi/1.4 were conducted. The hFIX mRNA levels in the liver of 1 -month 
(young) or 15-month (aged) mice carrying -802FIXml (mouse P327 or P552, respectively) 

30 and -802FIXml/L4 (mouse P32 and P697, respectively) are shown in Figure 5. These 
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animals were from the same litter produced by the founder F 17549 for -802FIX ml and 
F229 for -802FIXml/l,4 (Figure 4 A and B). At the time of sacrifice, P552 and P697 were 
expressing 2200 and 1658 ng/ml of hFIX, respectively. The total liver RNA (15 |ag from 
each animal was used for the Northern blot analysis performed as described in Figure 3A. 
5 Upper panel: probed with the Ssp I/BamH I fragment of hFIX cDNA; lower panel: 

rehybridized with RNR18 (ribosomal RNA) probe. Lane 1: non-transgenic mouse liver; lane 
2: transgenic P327 liver RNA; lane 3: transgenic P552 liver RNA; lane 4: transgenic P32 
liver RNA; lane 5: transgenic P697 liver RNA. Phosphorlmager (Molecular Dynamics) was 
used for quantitation of mRNA levels (counts) and ratios of young versus old were 
10 calculated. Young and old animals carrying -802FIXml showed no sijgnificant differences in 
the mRNA level (the ratio of old over young: 0.92). In contrast, -802FIXml/1.4 animals 
showed a substantial elevation in the mRNA level with older age (the ratio of old over 
young: 1.54). . 

These results (Figure 5, lanes 4 and 5) indicated the presence of another important 

15 age-regulatory nucleotide sequence, designated AE3', which is located approximately in the 
middle of the 3 ' UTR where an extensive stretch of dinucleotide repeating structures were 
contained. In the presence of AE5', AE3' clearly confers a crucial age-associated increase in 
hFIX expression. This conclusion was further supported by results obtained with 
-2231FIXml /1 .4 (n=42) (Figure 4D). The unique concerted function conferred by the 

20 combination of AE5' and AE3' was again independent of founder line, initial expression 
levels at one month of age, sex, generation, or zygosity status of animals. 

Interestingly, animals with sustained high hFIX levels in the circulation 
(approximately 1,500 ng/ml or higher) tended to die at a much earlier age than the expected 
life span (-2 years) (Figure 4 A, B, D). This happened to both males and females, but 

25 appears to be more frequent in males. Without limiting the invention to any particular 

mechanism, it is believed that since these transgenic mice have hFIX in addition to their own 
mFIX, they may be at an increased risk of lethal thrombosis compared to wild type mice 
which do not express the transgenes. 

The above-described characterization of transgenic mice harboring hFIX transgenes 

30 demonstrates that (a) while the presence of AE5' in vitro in HepG2 cells did not affect hFIX 
gene expression, the presence of AE5' in vivo resulted in a dramatic age-dependent increased 
stability in hFIX gene expression, (b) the age-dependent decline in the circulatory hFIX level 
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observed in animals with -416FIXml, -590FIXml, -679FIXml and -776FIXml is directly 
correlated with the decrease in the steady-state mRNA level, which the inventors believe to 
be due to an age-dependent decline in the transcriptional activity of the transgenes, and (c) 
animals carrying -802FIXml/1.4 shwoed a substantial elevation in the liver mRNA levels of 
5 hFIX with older age. 

EXAMPLE 4 

Pootprint And Gel Electrophoretic Mobility Shift Analysis Of The Region From 
Nucleotides -665 To -805 of Human Factor IX 

10 In order to make a preliminary determination of the region v^ithin AE5* which is 

involved in the function of AE5', footprint analysis and gel electrophoretic mobility shift 
assays were performed as follows. 

A, Footprint Analysis 

15 For footprint analysis of the region spanning from nt -665 through nt -805, the 

fragments used were amplified by PGR with the ^^P-labeled 5' primer 
ATGGTTAACTGACTTACGAA (nt -833 through -814) (SEQ ID NO:29) and 3' imlabeled 
primer GCTCCATTTTGAGTTAATATTTGTGT (nt -657 through -682) (SEQ ID NO:30). 
The nuclear extracts (NEs) from HepG2 human hepatoma cells and livers of young (1 month 

20 of age) and old (19 months of age) mice were prepared as previously reported [Kurachi et al. 
(1986) Biochemistry 33:1580-1591]. Various amounts of NEs (0, 100 and 150 ^g) were 
incubated with the labeled fragments (30,000 CPM) for 1 hour on ice and subjected to 
DNase 1 digestion (0.5 unit) for 2 min at room temperature. The samples tested included 
those without NEs, with 100 jag and 150 ^g of HepG2 cell NEs, with 100 |ag and 150 \xg 

25 NEs from old mice, and with 100 [ig and 150 )ag NEs from young mice. Major and minor 
footprints and apparent DNase hypersensitivity sites were observed. 

Footprint analysis of the region nt -665 through -805 with aged mouse liver nuclear 
extracts showed a major footprint (nt -784 through -802), a minor foot printout -721 through 
-728) and an interesting DNase hypersensitive region (nt -670 through -714). With nuclear 

30 extracts from one month-old animals or HepG2 cells, no such clear footprints were observed. 
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B. Gel Electrophoretic Mobility Shift Assay 

Gel electrophoretic mobility shift assay using mouse liver nuclear extracts from three 
different age groups was used. Nuclear extracts were prepared from 1, 5 or 19 month-old 
mouse livers (as described supra). Double stranded oligonucleotides containing a PEA-3 
5 nucleotide sequence spanning from nt -797 to -776 of the hFIX gene 

(TTCAGTCGAGGAAGGATAGGGT) (SEQ ID N0:31) were ^^P-labeled at the 5' end to a 
specific activity of 1.9x10' cpm. Aliquots of the radio-labeled oligonucleotide (20,000 cpm) 
were incubated with 10 ng of NEs in the presence of l .jig of poly dl-dC in DNA binding 
buffer for 20 min at room temperature and subjected to polyacrylamide gel electrophoresis 
10 (Kurachi et al. (1986) supra). In Figure 6 A, Lane 1: without NEs; lane 2: with NEs of 1 
month-old mice; lane 3: with NEs of 5 month-old mice; land 4: with NEs of 19 month-old 
mice; lane 5: with mouse brain NEs (positive control for PEA-3, showing a slightly higher 
size of shifted band). 

Figure 6B shows the results of the competition assay for -^^P-labeled double stranded 

15 oligonucleotides containing the PEA-3 nucleotide sequence. A 100-fold excess unlabeled 
oligonucleotide described in the preceding paragraph or mutant oligonucleotide 
[TTCAGTCGGTTGGTGATAGGGT (SEQ ID NO:32) with mutated sequences underlined] 
was incubated with 10 jig of 19 month-old mouse liver NEs for 5 min followed by addition 
of ^^P-labeled oligonucleotides as described supra. Lane 1: without NEs; lane 2: with NEs; 

20 lane 3: with NEs and wildtype competitor; lane 4: with NEs and mutant competitor. 

In agreement with the above results of footprinting, gel electrophoretic mobility-shift 
(bandshift) assays showed an increase in protein binding with the nuclear extracts of aged 
mice (19 months of age) (Figure 6 A). Bandshifts were competitively reduced with excess 
amounts of oligonucleotides harboring the PEA-3 inotif, but not with oligonucleotides 

25 harboring a mutant PEA-3 motif sequence (Figure 6B), thus confirming the presence of the 
PEA-3 motif in AE5'. This is the first time that the PEA-3 protein, which is a member of 
the Ets family of transcription factors and which has been shown to bind to nucleotide 
sequences [SEQ ID NO:40; SEQ ID NO:48; and SEQ ID NO:84] that are homologous to the 
PEA-3 nucleotide sequence within the AE5' region [Karim et al (1990) Genes & Develop. 

30 4:1451-1453; Nelsen et al (1993) Science 261:82-86; Fisher et al (1991) Oncogene 6:2249- 
2254], has been implicated in such a unique role in age-stable regulation of a gene. 
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Without limiting the invention to any particular mechanism, the PEA-3 nucleotide 
sequence in the hFIX gene appears to have been generated through evolutionary- drift of a LI 
sequence originally recruited presumably via its retrotransposition into the 5' specific 
location. Modem retrotransposable LI [Kazazian et al (1988) Nature 332:164-166; 
5 Dombroski et aL (1993) Proc. Natl. Acad. Sci. USA 90:6513-6517; Minakami et al. (1992) 
Nucl. Acids Res. 20:3139-3145; Dombroskiet et aL (1994) Mol. Cell. Biol, 14:4485-4492] 
does not have the corresponding PEA-3 nucleotide sequence. The PEA-3 nucleotide 
sequence of AE5' nucleotide sequence resides within the LI -derived sequence retaining a 63- 
70% similarity with the ORF2 corresponding region of the modern retrotransposable LI in 

10 the 5' to 3' orientation. Interestingly, the murine FIX gene also has the Ll-derived 

nucleotide sequence in its 5' end region in an almost identical position as in the hFIX gene, 
and has multiple PEA-3 consensus nucleotide elements [Kawarura et a/, in Organization of 
LI Sequence in the 5* Flanking Region of Factor IX Gene [in preparation]. Age- regulation 
of the murine FIX gene is indeed very similar to that of the hFIX gene [Sweeney and 

15 Hoemig (1993) Am. J. Clin. Pathol. 99:687-688; Kurachi et aL (1996) Thromb. Haemost. 
76:965-969], thus providing further insights into the evolutionary origin of the molecular 
mechanisms underlying age-associated regulation of the FIX gene. 

EXAMPLE 5 

20 Liver-Specific Expression Of Tiie Exemplary hFIX Gene Under Control Of The hFIX 

Promoter 

Expression of the natural FIX gene is virtually restricted to the liver [Salier et aL 
(1990) J. Biol. Chem, 265:7062-7068]. In order to determine whether any of the upstream 
and/or downstream sequences in the hFIX minigenes directed liver-specific expression of the 

25 hFIX transgene; Northern blot analysis was carried out as described supra (Example 3) in 
transgenic mice carrying -416FIXml and -802 FIXml expression vectors. Animals 
expressing hFIX at high level (PA412 and P580 carrymg -416FIXml and -802FIXml, 
respectively) were sacrificed at one month of age and total RNA was extracted from liver, 
lung, intestine, muscle, kidney, brain and heart and from untransfected HepG2 cells (negative 

30 control). The results in transgenic mice carrying -416FIXml and -802 FIXml are shown in 
Figure 7A and B, respectively. 
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In Figure 7, the positions of hFIX mRNA, RNR18 (control for RNA loading in 
wells), and ribosomal 28S and 18S RNA bands are shown on the left and right sides, 
respectively. Animals with -416FIXm/1.4 and -679FIXml showed tissue specific expression 
patterns similar to that of -4l6FIXml (A) (data not shown). Interestingly, liver expression 
5 of hFIX observed for minigenes lacking the region containing AE5' (except -770FIXmI, 

which remains to be tested as sufficient progeny animals become available) was high, but not 
as robust, as that seen with the natural gene. In addition, these minigenes expressed not only 
in the liver, but also in other tissues, such as kidney, lung and muscle, at various levels as 
high as -20% of tfie liver level (Figure 7 A). In clear contrast, animals with -802FIXml 
10 showed substantially liver-specific hFIX expression similar to that for the natural FIX gene 
(Figure 7B). These results suggest that the AE5' region controls liver specific expression of 
hFIX. 

An apolipoprotein(a) transcription cotnrol region (ACR) whcih contains an ETS 
family target sequence 5'-CCCGGAAG-3' (SEQ ID NO:48) has been shown to exhibit • 
15 enahncer activity in vitro in liver-derived HepG2 cells. However, the ACR does not appear 
to be liver-specific [Yang et al. (1998), supra]. 



20 



EXAMPLE 6 
Expression Of PEA-3 Protein In HepG2 Liver Cells 



Expression of the transgene FIX was observed in v/vo, but not in vitro in HepG2 
cells, when expression vectors containing AE5* were used (See, Examples 2 and 3, supra). 
This observation, together with the absence of a footprint in HepG2 cell NEs (See, Example 
4, supra) suggested to the inventors that HepG2 cells' lack of expression of the FIX 

25 transgene may be a result of the cells* expression of low levels of the PEA-3 protem (and/or 
the PEA-3 related protein) which binds to homologs of the invention's PEA-3 nucleotide 
sequence. The complete human PEA-3 cDNA has not yet been cloned (the human PEA-3 
cDNA sequence of GenBank accession number U18018 lacks 8 amino acids at the N- 
terminal region when compared to the mouse PEA-3 cDNA sequence). In order to 

30 determine the role of the PEA-3 nucleotide sequence in gene expression in vitro^ HepG2 
cells which overexpress mouse PEA-3 protein were constructed as follows. 
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Expression constructs containing the mouse PEA-3 cDNA sequence (GenBank 
Accession Number X63 1 90; Figure 9) were constructed as follows. Using the reported 
mouse PEA-3 cDNA sequence three sets of PGR primers were synthesized such that the 
entire coding region and parts of the flanking sequences would be amplified. Reverse 
5 transcription PGR (RT-PGR) was carried out, and the amplified mouse PEA-3 cDNA 

sequence was inserted into an expression vector under the control of the SV40 promoter, 
which does not interfere with the factor IX promoter (data not shown). 

The PEA-3 expression vector is used to transfect HepG2 cells using the FuGene 6 
(Boehringer Manheim) since this method was shown to improve transfection efficiency (See, 
10 Example 2, supra). Transfected HepG2 cells are screened for expression of PEA-3 by 

Northern blot analysis and/or Western blot analysis using commercially available antibody. 
Transfected HepG2 cell lines which stably express PEA-3 protein are selected for fiirther 
use, e.g., to analyze the underlying mechanism of PEA-3 action. 

15 EXAMPLE 7 

In vitro And In Vivo Expression Of Exemplary Human Protein C Minigene Expression 

Vectors Containing AE5' And AE3' 

Protein C is a factor which plays a critical role in the anti-blood coagulation 
20 mechanism. Unlike factor IX, whose level in the circulation substantially increases with 
advancing age, protein G levels in the circulation do not increase with advancing age, but 
rather show a slight decrease over time. This decrease in circulating protein C levels is 
believed by the inventors to be the result of regulation at the gene transcription level. For 
this reason, the protein G gene provides an interesting exemplary gene for demonstrating the 
25 . universality of the AE5' and AE3' fimction in gene expression both in vitro and in vivo as 
follows. In this Example, bases are numbered relative to the major transcription start site 
(+1) as previously described [Miao et al. (1996) J. Biol. Ghem. 16:9587-9594]. 

A. Construction of Human Protein C Minigene Expression Vectors 

30 The human protein G genomic sequence has been previously reported (GenBank 

accession number Ml 1228; Figure 12B]. Using this sequence, three protein G minigene 
expression vectors were prepared. The first human protein G minigene vector (-1426PGml) 
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contained the human protein C promoter region of the protein C gene (GenBank accession 
number Ml 1228; Figure 12 B) ligated to the human protein C cDNA (GenBank accession 
number X02750; Figure 12 A) which contains the first entire intron and poly- A sequence. 
The second human protein C minigene (AE57-l426PCml) was the same as the first vector 
5 except that it additionally contained the nucleotide sequence AE5' at the 5' end of the 

human protein C cDNA. The third human protein C minigene (AE57-1426PCml/AE3') was 
the same as the first vector vector except that it additionally contained the nucleotide 
sequences AE5* and AE3' at the 5* and 3' ends, respectively, of the human protein C 
cDNA. 

10 

B. Transient Expression Of human Protein C in vitro In HepG2 Cells 

Each of the protein C minigene expression vectors was transfected into HepG2 cells 
using the FuGene 6 (Boehringer Mannheim). Four to five independent assays of human 
protein C activity were carried out as previously described [Turkey et al (1999) Throm. 
15 Haemost. 81:727-732]. HepG2 cells transfected with the -1426PCml vector showed in vitro 
transient protein C activities which were comperable to the activities sho\vn by HepG2 cells * 
transfected with the AE5V-1426PCml vector (i.e., (60-70 ng/10E6 cells/24 hrs). 

C. Generation Of Transgenic Mice Harboring The protein C Minigene 
20 Expression Vectors 

In order to determine whether AE5' in combination with AE3' is capable of 
increasing human protein C expression with advancing age, as observed for factor IX 
expression (Example 3, supra), transgenic mice which harbor the protein C minigene 
expression vectors are generated according to standard methods [Hogan et al, (1994), supra]. 

25 Briefly, protein C minigene vector plasmids are injected into fertilized eggs of C57B/6 X 
SJL mice and implanted into foster mother animals (CD-I). Offspring produced are 
screened for founder animals with high transgene copy numbers (5-10 copies per genome) 
using quantitative multiplex PGR analyses of tail tissue DNA samples using two pairs of 
primers which are designed to amplify a unique fragment from the protein C transgenes and 

30 a 494 ^p fragment from the mouse P-globin gene, respectively. 

Founders are back-crossed with non-transgenic mice (C57B/6 X SJL) to generate Fl 
progeny animals. Homozygous F2 animals are generated by crossing among heterozygous 
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Fl littermates and the following generations are similarly generated. Zygosity status of 
animals is determined by quantitative multiplex PCR analysis as described above. Founder 
lines for each minigene construct are subjected to longitudinal analysis for their entire life 
spans up to two years. 

5 Circulatory human protein C levels are monitored during longitudinal analyses of 

transgenic mice from the representative founder lines carrying the protein C minigene 
transgenes. Age-regulation patterns of circulatory human protein C levels are compared 
among all animals for each specific construct, different founder lines, different initial himian 
protein C level at pre-pubertal age (one month) due to transgene positional effects, 

10 generation (founders and Fl or F2 progeny), sex, and zygosity (homozygous / heterozygous) 
status of the transgenes. 

Northern blot analyses of the liver RNA samples from animals is carried out using 
stringent washing conditions to determine whether any changes in circulatory human protein 
C levels are correlated with similar changes in the steady-state liver human protein C 

15 mRNA, rather than with loss of the human protein C transgene with age or with changes in 
human protein C turnover time. Observation of transgenic animals which contain the human 
protein C sequence as well as AE5' and AE3* and which increase stable circulatory human 
protein C levels with increasing animal age, as compared to the levels in transgenic animals 
which express the human protein C sequence in the absence of AE5* and AE3*, demonstrates 

20 that the combination of AE5* and AE3* functions in age-stable expression of the exemplary 
human protein C gene. This observation will confirm that these results may ba achieved for 
genes other than the exempary hFIX and protein C genes. 

EXAMPLE 8 

25 In Vivo Expression Of Exemplary Expression 

Vectors Containing The Cytomegalovirus (CMV) Promoter 
And The AE5' And AE3' 
This Example is carried out to demonstrate the universality of the age-related jgene 
expression regulatory function of AE5* and AE3' with viral promoters. The CMV promoter 
30 is currently used in several gene therapy constructs but its activity decreases with time in the 
Liver. Furthermore, the activity of the CMVpromoter in the liver of transgenic mice is 
known to be lower than the activity in other tissues, such as muscle. Thus, this Example 
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investigates whether the combination of AE5* and AE3* halts or reverses the decline in the 



activity of the CMVpromoter in the liver. 

The above-described -416FIXml expression vector (Example 1, and Figure 1) is used 
to construct a control vector to determine the effect of AE5' and AE3' on liver and 
5 circulatory levels of expression of human factor IX in transgenic animals. The control 

vector in which expression of the hPIX gene is under the control of the CMV promoter in 
the absence of both AE5' and AE3* is constructed by replacing the human factor IX 
promoter sequence with the CMV promoter sequence (National Vector Core for Non- Viral 
Vectors at the University of Michigan) (the CMV promoter is also located between positions 
10 +1 to +596 in vector plasmid pCR3 from Invitrogen). The resultant expression vector in 
which the himian factor IX gene is under the control of the CMV promoter is transfected 
into HepG2 cells. Transfected cells are expected to show human factor IX activity. 

The -802FIXml/1.4 vector of Figure 1 is used to construct a test vector in which the 
human factor IX promoter sequence of -802FIXml/1.4 vector (which contains both AE5' 
15 and AE3') is replaced with the CMV promoter sequence. 

In order to determine whether the combination of AE5' and AE3' is capable of 
increasing human factor IX expression with advancing age under the control of the CMV 
promoter, as observed for factor IX expression under the control of the factor IX promoter 
(Example 3, supra), transgenic mice which harbor either the control vector or the test vector 
20 are generated according to standard methods as described supra (Examples 3 and 7). The 
mRNA levels of human factor IX in the blood, liver and other tissues are monitored during 
longitudinal analyses of the transgenic mice. Age-regulation patterns of human factor IX 
mRNA levels in the different tissues are compared among all animals as described supra for 
each specific construct, different founder lines, different initial human factor IX levels at pre- 
25 pubertal age due to transgene positional effects, generation, sex, and zygosity status of the 



The observation of transgenic animals which contain the test vector and which 
increase stable circulatory human factor IX mRNA levels, as compared to the circulatory 
mRNA levels in transgenic animals which contain the control vector, demonstrates that the 
30 combination of AE5' and AE3* functions in increasing the activity of the exemplary CMV 
promoter. 



transgenes. 
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EXAMPLE 9 

Liver-Specific Expression Of The Exmplary Human Factor IX Gene Under The 

Control Of The CMV Promoter 

5 This Example investigates whether the presence of AE5' imparts liver specific 

activity to the CMV promoter, which otherwise drives gene expression in several tissues in 
addition to the liver. 

The -802FIXml vector which contains AE5* and lacks AE3* (Figure 1) is used to 
construct a test vector in which the human factor IX promoter sequence of the -802FIXml 

10 vector is replaced with the CMV promoter sequence. This test vector is used in parallel 

experiments with the control vector of Example 8 in which expression of the hFIX gene is 
under the control of the CMV promoter in the absence of both AE5' and AE3\ Northern 
blot analysis is carried out as described supra (Example 3) in transgenic mice carrying the 
control vector or test vector. Animals expressing hFIX at high level are sacrificed at one 

15 month of age and total 'RNA is extracted from liver, lung, intestine, muscle, kidney, brain 

and heart and from untransfected HepG2 cells (negative control). The levels of hFIX mRNA 
in the different tissues are compared. It is expected that transgenic animals harboring the 
control vector will express hFIX mRNA in liver as well as in at least one other tissue. In 
contrast, the observation that transgenic animals which harbor the test vector express hFIX 

20 mRNA in the liver and not in other tissues indicates that AE5' confers liver-specific activity 
to the exemplary CMV promoter. 

From the above, it is clear that the invention provides methods for age-related and 
liver-specific gene expression and models for age-related and liver-specific diseases. 

25 

EXAMPLE 10 

Construction Of A Series of Exemplary Human Protein C 
Minlgene Expression Vectors 

This Example and the following Examples 11-12 were carried out to fiirther 
30 demonstrate the universality of the function of AE5* and AE3*' sequences with respect to 
regulating expression of the exemplary human protein C as described in Example 7, supra. 
The human protein C genomic sequence has been previously reported (GenBank accession 
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number Ml 1228; Figure 12B]. In particular, Figure 14 shows the nucleotide sequence (SEQ 
ID NO:85) of the S'-end of the human protein C gene [Miao et aL (1996) J. Biol. Chem. 
16:9587-9594]. In Figure 14, bases are numbered relative to the major transcription start site 
(+1) as previously described [Miao et al. (1996) L Biol. Chem. 16:9587-9594]. Two minor 
5 start sites are marked with double asterisks. Exons are underlined. The translation start 
codon (ATG) is shown in boldface. 

Using this sequence, eight protein C minigene expression vectors were prepared as 
shown in Figure 15. Fat lines at 5* and 3* ends represent flanking sequences. Introns are 
shown by blank rectangles. Exons are shown by shaded rectangles (corresponding to the 3' 

10 UTR part of the last exon). Arrows indicate transcriptional start sites. The vectors 

contained the human FIX AE5' sequence (SEQ ID NO:l; nt -802 through -771 of Figure 8; 
32 bp in size) linked to Sph I linker sequences and/or the human FIX AE3'' sequence (SEQ 
ID NO:93; nt 32,110 through nt 32,263 of Figure 8, 154-bp long) linked to Sse8387 I linker 
sequences. All the amplified sequences of the human protein C minigene vectors were 

15 verified by dideoxy sequencing. 

The first human protein C (hPC) minigene vector (-1462hPCml) was composed of 
the 5' flanking sequence of human protein C up to nt -1462, exon I sequence, the complete 
first intron at the natural site (1431 bp in length) and the contiguous following sequence 
containing exons 2-9 (derived from the hPC cDNA) and the 3' immediate flanking genomic 

20 sequence through nt 11,108 (325 bp in length). As shown in Figure 15, nucleotide 67 was 
the first base of intron I, nucleotide 1,497 was the first base of exon 2 (the first base of the 
Met codon is nucleotide 1,514), and nucleotide 10,488 was the last base of the stop codon. 

To construct -1462hPCml, a region spanning nt -1462 through 1,560 (3,022 bp in 
length) of the hPC gene was amplified by PGR (Expand High Fidelity PGR System, 

25 Boehringer Mannheim) using 5' and 3' primers containing Sph I linker and the unique 

internal Msc I site in the exon 2, respectively, and human genomic DNA as the template. 
The generated fragment, containing the 5' flanking sequence, exon I, intron I and a short 5' 
portion of exon 2 to the internal Msc I site, was then inserted into a hPC cDNA plasmid, 
PUCl 19-hPC, in between Sph I and Msc I sites by replacing its 5' portion of the hPC cDNA 

30 sequence. The 3' end of the resulting minigene, -1462hPCm', contained the entire 3' UTR, 
but only up to poIy(A) attachment site (nt 10,783). Its 3' end region (the 3' sequence 
beyond the internal Sse8387 I site in the 3* UTR) was then fireed by Sse8387 I/EcoR I 
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double digestion, and replaced with a Sse8387 I/EcoR I fragment (612 bp in the length, 
spanning nt 10,497 through 11,108 of hPC gene), which was generated by PCR. The hPC 
minigene, thus constructed was named -1462hPCml, (approximately 4981 bp in length) and 
served as a parent construct for generating other hPC minigene constructs for generating 
5 other hPC minigene constructs as described below. This hPC minigene vector was used to 
construct the first transgenic mouse colony, which was used as a positive control for hPC 
expression. 

The second human protein C minigene vector (-82hPCml) was another control 
human hPC minigene. It was the same as -1462hPCml, except that its promoter region was 

10 extended only up to nt -82 instead of to nt -1462, and thus contained the hPC upstream 

sequence (SEQ ID NO:86) from nt -82 to nt +1. Minigene -1462hPCml was subjected to 
Sph I digestion, followed by partial MscI digestion, releasing the 5' half region spanning nt 
-1462 in the 5' upstream through nt 1,547 at the internal Msc I site in exon 2. Due to 
another Msc 1 site in the first intron, partial digestion was needed to get the needed 5' end 

15 half fragment. This region was then replaced by smaller Sph I/Msc I fragments generated by 
PCR, spanning nt -82 in the 5* upstream through the internal Msc I site, thus generating 
-82hPCml. 

The third human protein C minigene vector (AE57-1462hPCml) was the same as 
-1462hPCml, except that it had an AE5' sequence (32 bp in length) inserted at the 5* end at 

20 an Sph I site. 

The fourth human protein C minigene vector (-l462hPCml/AE3'*) was the same as 
-1462hPCml, except it had an AE3" sequence inserted at the Sse8387 I site within the 3* 
untranslated region (UTR) of the hPC minigene. 

The fifth human protein C minigene vector (AE5V-1462hPCml/AE3'*) was the same 

25 as -1462hPCml, except it had both AE5* and AE3". The AE5' fragment (32 bp in length, 
sparming nt -802 through -771 of hFIX gene) was amplified by PCR (Expand High Fidelity 
PCR System, Boehringer Mannheim) with human genomic DNA as template and PCR 
primers containing Sph I linker sequences. The AE3" fragment spanning nt 32,110 through 
nt 32,263, and containing the potential stem-loop structure forming region (nt 32,142 through 

30 32,243) was produced by PCR using primers containing the internal Sse8387 I site 

sequences. AE5* and AE3" fragments with Sph I or Sse8387 I sticky ends, respectively, 
were then inserted into -l462hPCml at Sph I site at the 5* end and Sse8387 I site in the 3' 
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UTR, respectively, thus generating minigene AE57-1462hPCml/AE3". Therefore, 
AE57-1462hPCml/AE3" is the same as .1462hPCml except it has both AE5' and AE3' 
sequences of the hFIX gene at the 5' end Sph I site and Sse8387 I site in the 3' UTR, 
respectively. 

5 The sixth and seventh human protein C minigene vectors (-849hPCml and 

-802hPCml) were the same as -1462hPCml, except that their 5' end sequences extended to 
nt -849 and -802, respectively, instead of to -1462. Construction of these minigenes was 
essentially the same as construction of the second human protein C minigene (-82hPCml); 
minigene -1462hPCml was subjected to Sph I digestion, followed by partial MscI digestion, 

10 releasing the 5' half region sparming nt -1462 in the 5' upstream region through nt 1,547 at 
the internal Msc I site in exon 2, Due to another Msc I site in the first intron, partial 
digestion was needed to get the needed 5' end half fragment. This region was then replaced 
by smaller Sph I/Msc I fragments generated by PGR, spanning nt -849, or -802 in the 5' 
upstream through the internal Msc I site, thus generating -849hPCml and -802hPCml, 

15 respectively. 

The eighth human protein C minigene vector (AE5V-82hPCml) was generated by 
inserting the AE5' sequence with Sph I sticky ends into the second vector, -82hPCml, at the 
5' end Sph I site. 



20 EXAMPLE 11 

Transient Expression of Exemplary Human Protein C Minigene 
Expression Vectors In Vitro In Human Hepatoma HepG2 Cell Line. 

Transient expression activities of the hPC minigenes of Example 10 were tested with 
HepG2 cells as described in Example 2, supra. Figure 16 shows the relative in vitro 

25 transient expression activities and standard deviations of the human protein C minigene 

expression constructs relative to the activity (approximately 70 ng/10^ cells/48 hr, defined as 
100% activity) of the control -1462hPCml construct. 

The relative transient expression activities in Figure 16 show that the results obtained 
with AE3" when in tandem with the hPC gene are fully consistent with what the inventors 

30 previously observed with AE3" when in tandem with the hFIX gene (Figure 1); the 
presence of AE3*' showed approximately a 30% suppression in transient expression- in 
comparison to the minigenes which lacked AE3'' (/.e., -1462hPCml/AE3'* compared with 
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-1462hPCml, and AE57-1462hPCml/AE3" compared with AE5V-1462hPCml). The 
control constructs -1462hPCml and -82hPCml showed similar transient expression activities 
to each other. 

As discussed above, these results were surprising because they were contrary to those 
5 previously reported by Miao et al. (1996), supra^ when using a heterologous reporter gene, 
chloramphenicol acetyltranferase (CAT), under the transcriptional control of varying lengths 
of the protein C 5 '-end sequences. 

EXAMPLE 12 

10 Generation And Analysis Of Transgenic Mice Harboring Human Protein C 

Minigene Expression Vectors 

Transgenic animals were constructed using each of the eight expression plasmids 
described above in Example 10 according to standard methods [Hogan et al. (1994) in 
"Manipulating the Mouse Embryo, a Laboratory Manual" (Cold Spring Harbor Press, New 

15 York, 2nd Edition) as described in Example 3, supra. Fertilized eggs of mice were 

microinjected with the minigene transgene DNA and implanted into foster mother animals. 

Circulatory hPC levels were monitored during longitudinal analyses of transgenic 
mice harboring.the -1462hPCml, -82hPCml, and AE57-1462hPCml/AE3" minigene 
transgenes. At various ages, starting at one month of age, transgenic mice were individually 

20 subjected to blood sample collection via tail-tip snipping, and the obtained serum was 

routinely used to quantify hPC levels in the circulation using ELISA for each age point. 
The ELISA assay employed a mouse monoclonal anti-hPC antibody (Celsus Laboratories) as 
a first antibody, and a rabbit polyclonal aiiti-hPC antibody (Celsus Laboratories) as a second 
antibody. Pooled human plasma (George King Bio-Medical) was used to prepare a hPC 

25 standard curve for each assay. The results are shown in Figure 17. The labeling in Figure 
17 reflects the tag numbers of animals containing each minigene construct. Figure 17 shows 
representative animals with -1462hPCml (A), -82hPCml (B), and AE57-1462hPCml/AE3" 
(C) expression vectors. 

Importantly, Figure 17 shows that age-regulation patterns were remarkably similar 

30 among all animals for each specific construct. In particular, the results show that transgenic 
animals containing the -1462hPC ml construct contained age-stable levels of human protein 
C, the animals expressed relatively constant levels of human protein C at different time 
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points during the life span of the transgenic animals (Figure 17 A). In direct contrast, the 
presence of AE5' and AE3* sequences resulted in increased expression levels of human 
protein C over time (Figure 17C). These results confirm the universality of the function of 
AE5' and AE3'' sequences in regulating expression of operably linked genes in an age- 
S related manner. 

The data also shows that, whereas transgenic animals containing the -1462hPC ml 
construct exhibited relatively constant and relatively high levels (from about 100 to about 
3000 ng/ml) of human protein C over time (Figure 17A), in dramatic contrast, transgenic 
animals containing the -82hPCml construct exhibited relatively low levels (from about 5 to 

10 about 40 ng/ml at 1 month of age) of human protein C which declined at a precipitous rate 
over time. Indeed, by the age of 5 months, human protein C levels were undetectable in all 
transgenic animals harboring the -82hPCml construct. These results demonstrate that the 
nucleotide sequence from nt -1462 to nt -83 of the human protein C gene directs age- stable 
expression as well as relatively higher levels of expression (as compared to the levels in the 

15 absence of the nucleotide sequence from nt -1462 to nt -83) of operably linked sequences of 
interest. 
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1 . A recombinant expression vector comprising in operable combination i) a 
nucleic acid sequence of interest, ii) a promoter sequence, and iii) one or more age 
regulatory sequences selected from SEQ ID NO:l, SEQ ID NO:3, a portion of SEQ ID 
NO:l, and a portion of SEQ ID NO:3. 

2. A method, comprising: 

a) providing: i) a cell, ii) a nucleic acid sequence of interest, iii) a 
promoter sequence, and iv) one or more age regulatory sequences selected from SEQ ID 
NO:l, SEQ ID NO:3, a portion of SEQ ID NO:l, and a portion of SEQ ID NO:3; 

b) operably linking said nucleic acid sequence of interest, said, promoter 
sequence, and said one or more age regulatory sequences to produce a transgene; and 

c) introducing said transgene into said cell to create a treated cell under 
conditions such that said nucleic acid sequence of interest is expressed in said treated cell. 

3. A substantially purified nucleic acid sequence comprising at least a portion of 
SEQ ID NO:93. 

4. The nucleic acid sequence of Claim 1, wherein said portion has age-related 
regulatory activity. 

5. The nucleic acid sequence of Claim 1, wherein said portion is selected from 
SEQ ID NO:9r, SEQ ID NO:94, SEQ ID NO:95; SEQ ID NO:96, SEQ ID NO:97, SEQ ID 
NO:98, SEQ ID NO:99, SEQ ID NO: 100, SEQ ID NO: 101, SEQ ID NO: 102, SEQ ID 
NO:103, SEQ ID NO:104, SEQ ID NO:105, SEQ ID NO:106, SEQ ID NO:107, SEQ ID 
NO:108, SEQ ID NO:109, SEQ ID NO:110, SEQ ID NO:lll, SEQ ID NO:112, SEQ ID 
NO:113, SEQ ID NO:114, SEQ ID NO:115, SEQ ID NO:116, SEQ ID NO:117, SEQ ID 
NO:ir8, SEQ ID NO:119, SEQ ID NO:120, SEQ ID NO:121, SEQ ID NO:122, SEQ ID 
NO:123, SEQ ID NO:124, SEQ ID NO:125, SEQ ID NO:126, SEQ ID NO:127, SEQ ID 
NO:128, SEQ ID NO:129, SEQ ID NO:130, SEQ ID NO:131, SEQ ID NO:132, SEQ ID 
NO:133, SEQ ID NO:134, SEQ ID NO:135, SEQ ID NO:136, SEQ ID NO:137, SEQ ID 
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NO:138, SEQ ID NO:139, SEQ ID NO:140, SEQ ID.NO:141, SEQ ID NO:142, SEQ ID 
NO:143, and SEQ ID NO:144. 



6. The nucleic acid sequence of Claim 3, wherein said portion is SEQ ID NO:9L 

5 

7. A recombinant expression vector comprising in operable combination i) a 
nucleic acid sequence of interest, ii) a promoter sequence, and iii) an age-related regulatory 
sequence selected from SEQ ID NO:93 and portions thereof 

10 8. The expression vector of Claim 7, , wherein said nucleic acid sequence of 

interest encodes a protein selected from factor Vni, factor VII, factor IX, factor X, 
prothrombin, protein C, antithrombin III, tissue factor pathway inhibitor, LDL-receptor, 
human a 1 -antitrypsin, antithrombin III, PEA-3 protein, p-galactosidase, and luciferase. 

15 9. The expression vector of Claim 7, wherein said promoter sequence is selected 

from himian factor IX promoter, cj^omegalovirus promoter, tRNA promoter, 5S rRNA 
promoters, histone gene promoters, RSV promoter, retrovirus LTR promoter, SV40 
promoter, PEPCK promoter, MT promoter, SRa promoter, P450 family promoters, GAL7 
promoter, T7 promoter, T3 promoter, SP6 promoter, Kl 1 promoter, and HIV promoter. 

20 

1 0. The expression vector of Claim 7, , wherein said portion of SEQ ID NO:93 is 
selected from SEQ ID NO:91, SEQ ID NO:94, SEQ ID NO:95; SEQ ID NO:96, SEQ ID 
NO:97, SEQ ID NO:98, SEQ ID NO:99, SEQ ID NOilOO, SEQ ID NO:101, SEQ ID 
NO:102, SEQ ID NO:103, SEQ ID NO:104, SEQ ID NO:105, SEQ ID NO:106, SEQ ID 

25 NO;107, SEQ ID NO:108, SEQ ID NO:109, SEQ ID NOillO, SEQ ID NO:lll, SEQ ID 
NO:112, SEQ ID NO:113, SEQ ID NO:114, SEQ ID NO:115, SEQ ID NO:116, SEQ ID 
NO:n7, SEQ ID NO.llS, SEQ ID NO:119, SEQ ID NO:120, SEQ ID NO:121, SEQ ID 
NO:122, SEQ ID NO:123, SEQ ID NO:124, SEQ ID NO:125, SEQ ID NO:126, SEQ ID 
NO:127, SEQ ID NO:128, SEQ ID NO:129, SEQ ID NO:130, SEQ ID NO:131, SEQ ED 

30 NO:132, SEQ ID NO:133, SEQ ID NO:134, SEQ ID NO:135, SEQ ID NO:136, SEQ ID . 
NO:137, SEQ ID NO:138, SEQ ID NO:139, SEQ ID NO:140, SEQ ID NO:141, SEQ ID 
NO:142, SEQ ID NO:143, and SEQ ID NO:144. 
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11. The expression vector of Claim 10, , wherein said portion is SEQ ID NO:91. 



12. The expression vector of Claim 7, further comprising in operable combination 
an age-related regulatory sequence selected from SEQ ID NO:l and portions thereof. 

13. A host cell containing the recombinant expression vector of Claim 7. 

14. The host cell of Claim 13, wherein said host cell is comprised in a tissue or 
organ in a living animal. 

15. The host cell of Claim 13, wherein said host cell is a gamete. 

16. The host cell of Claim 13, wherein said host cell is selected from bacterial 
cell, yeast cell, plant cell, insect cell, and mammalian cell. 

17. A method for expressing a nucleic acid sequence of interest, comprising: 

a) providing: 

i) a cell; 

ii) a nucleic acid sequence of interest; 

iii) a promoter sequence; and 

iv) an age-related regulatory sequence selected from SEQ ID NO:93 and 
portions thereof; 

b) operably linking said nucleic acid sequence of interest, said promoter 
sequence, and said age-related regulatory sequence to produce a transgene; and 

c) introducing said transgene into said cell to create a treated cell under 
conditions such that the nucleic acid sequence of interest is expressed in said treated cell. 

18. The method of Claim 17, wherein said treated cell is comprised in a tissue or 
organ in a living animal. 
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19, A substantially purified nucleic acid sequence comprising a nucleotide 
sequence selected from at least a portion of SEQ ID NO:85, and at least a portion of SEQ 
ID NO:92. 



20. A method for expressing a nucleic acid sequence of interest, comprising: 

a) providing: 

i) a cell; 

ii) a nucleic acid sequence of interest; 

iii) a promoter sequence; and 

iv) a nucleotide sequence having activity selected from age-related 
regulatory activity and regulatory activity, said nucleotide sequence selected from SEQ ED 
NO:92, a portion of SEQ ID NO:92, SEQ ID NO:85, a portion of SEQ ID NO:85, SEQ ID 
NO:89, and SEQ ID NO:90; 

b) operably linking said nucleic acid sequence of interest, said promoter 
sequence, and said nucleotide sequence to produce a transgene; and 

c) introducing said transgene into said cell to create a treated cell under 
conditions such that the nucleic acid sequence of interest is expressed in said treated cell. 
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Figure 8A 
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TCGCACATCT 
AGCCCTTAGA 
GACAGTGTTA 
AAAAAATTTC 

AAAAAAtAGG 
AAAATTCTAC 
TA6AATATAT 
CATmGCTA 
TGAAGATGTG 
TCACnCTCC 



CAGCAAACXC 
ATTCACAATT 
ATAAAAGACG 
TCAATGCCAT 
GTCAA9CCTA 
CAAAACAGA6 
AATGGGGAAA 
AATTAATTCA 
TTGGACTTCA 
CTACCATCAG 
ACAAATTTAC 
TGAAAAAATG 
AGGAAGCAAC 
CGATTCCTCA 
ATGCACACCT 
ATATACACCT 
C CCAACGACA 
CCTGTTGTGG 
ATACATATCr 
ACAGTTAAAA 

AGAAC CCATT 
GTTT6ACCTA 
AAAAAAA CTA 
VCTGT6AAA6 
AAACAAOCR 
AACAVCATK 
TCAC1T0CT6 
CAACACCCCA 



TCACGATAAC 
GCTTCAAAGA 
ATACAAACAA 
CCCCATCAAG 
AGCCAAAAGA 
ATACAGACCA 
OGAATmiA 
AGATGGATTA 
tCrCTAAAAC 
AGTGAACAGG 
AGAAAAAAAC 
CTCACCATCA 
AGGTGCTGGA 
GGGATCTAGA 
Alt^TU'fATTG 
AGGAATACTA 
AAAAACCAAA 
GGTGGGGGGC 
AACAAACCTG 
ATATCTTACC 
GGCATAAATA 
CAG1CGAGGA 

TCAGGAtCTA 
AOCTAATTAA 
AACAATCT GA 
ctctctgaca 
TTTCAGACTC 
TTGGAAATAG 



AAAAtCAATG 
GAATAAAATA 
ATCGAAGAAC 
CTACCAATGA 
ACAAAGCTG6 
ATCGAA C A CA 
ATTAATAAAT 
AAGACTKAT 
ACCAAAACCA 
CAACCTAAAC 
AAACAACCCC 
CCGGCCATCA 
GAGGATGTGG 
CCTAGAAATA 

CGGCAcrrrr 

GGCAGCCATA 
CACeoCATGT 
GAGGGGACCC 
CTCGTTCrOC 
AAAACAACAA 
CA1CAATGGC 
AGGATAGGGT 



GTGTTACTGG 
GAGCACAAA A 
CAGTAAAAAf 
AAGATACGGT 
AAATCACCCA 
9CCAAAGAi£c 



TGCAAAAATC • 

CCTAC6AATC < 

ATTACAtCCT < 

CWTCTTCAC . 

AGGCATCATG - 
ACA6A0CCCT ^• 
GGTGCTGGGA 

TGITACACCr < 

ATCGCAACAA - 

AAtCGGAGAA • 

ATCAACAAGT < 

CAGAAATGCA * 

AGAAATAGGA < 

CCATCTGACC • 

CACAATAGCA • 

AAAAGAAAAT ' 

tCTCACXCAT • 

ATAGCAtTAfi' < 

M:ATGrACCC • 
tJiXAGTTGGA 

GGTATTGTtG 
9ATTAACTCA 
AAGAGTTCTA 
CACAAGCtAC 
AAAAAATCAG 
CCGTCCCACT 
GAGTGGCAGA 
CATTGAGGGA 

net 
GGTT ATS 



GATGCACATT ATITCCCAGA AGTAAATACA GCTCAGCTTG TACmGGTA CAACTAATCG ACCTTACCAC TTTCACAATC TGCTAGCAAA 

Gin Arg Acn Net Zl* Met Ale Glu Ger Pre Gly Leo He Thr llm Cye Leu Leu Cly Tyr Len Lee See Ale Gin Clrs 
CAG CGC GTG AAC ATG ATC ATG GCA GAA TCA CCA GGC CTC ATC ACC AtC TCC CTT TXA CCA TAT CT A CK ACT G» m 

Ths 

ACA G^OTTT GmCCTm TTAAAATACA TTGAGTATGC TTC CC TTTTA GATATAGAAA TATCTGATGC IGACr X CHC ACTAAATXTT GATTACATGA 



TTTCACACCA 
TAGATCGACA 
TAAAAAAATT 
CGAAACAAAA 
TA ATllfl TA 
GATTAGGAAA 
OCTTAAGAAT 
CXTACAGGGA 
TGACAGTAOC 
CTCAAATGCT 
AOGACATAAA 
G6TTGTAA6A 
CTAGCGATAA 
CCCTTGAGGA 
OGOTGAGTOA 
CACmAGTT 
ArOGGGATAA 
ATAGAGAAAC 
CTGATATAAG 
ACTATGACAA 
CttAGTATAA 
CTACCACAGA 
AZATTAAGGA 
AGAIkTAATGT 
1C1ATCTTCC 
CTCCAACACA 
nrCTGACTA 
tftCtACAGTG 
VMACTTGTA 
fGAAAIUlGGA 
KAGGAGTTA 
OCAGTAAATA 
*C\OAAC TTA 
1TCTCTCACT 
t<aiCTTGTCC 
tCTACAGTCA 
CftTetACAGA 
WWCATTTA 
&GCGGGTCTT 
tKCTGAGCA 
OCCUCAGA A 
CCCAAfGAAG 
gP^CACCACC 



ATATTGAAGA 
ATGCTTATGA 
AAAAGTGGGA 
GCACAAACAA 
AATTATATAT 
AAATCATTTT 
TGACAIAAAG 
AGAAATTATC 
AGGATCA06G 
GAAATAACGA 
CCA AGgCCA T 
GGCTTCTGCC 
ACCTGAAGGG 
ACGGGCCA6G 
TTTCCTGAGA 
TTGTAAAGTG 
ACCAGACTCC 
GAACACATTA 
GGCATTTTAT 
GTGAGACAGG 
ACATAAGGAG 
GCAGTCCTGA 
AGTAGAACTG 
CTAAGAGAAC 
TCATTTATAA 

TAGG7GA6TA 
CCTCG CACTA 
TGCCCCGAGA 
AAAAGCTGAG 
GAGCAAAGCG 
CATAAACTCA 
TTAAAGAGCT 
CATCCTGCCA 
AAGGTCACAG 
GCTCTCCTTC 
CTCTTTTArC 
AACTACAAAG 
GCCACCAATT 
GAGAGGTCGG 
GGATGAAG6C 



6TCTAACACC 
TGCATTTATG 
AAACAAAGAA 
TGGCCTTATT 
CTTCAATTTA 
GTCTCTATGT 
AGTACGAAGT 
CCATTTG6AC 
CTGCCAACCC 
TAAAAAAAAA 
TAGATATATC 
CCCTTGAAGA 
AAG TTAAGTA 
GGAATTTTTC 
TCTTTCCATT 
TATGCATCAA 
CTCTTTGATC 
AATGGGGAAA 
GCAAASAAGG 
TAAACTAGGC 
TACCACrCAT 
GACTATTCCA 
CTA TCACTTA 
TGGCTCTTTA 
AAATAAGAT6 
TACATATACA 
TTTOGAACTC 
GTGGTTGTAT 
AGTGAGGACC 
GCTGGAGCCY 
GGGAACCAGA 
AAATTATTCT 
AAAGAGAATT 
ACACCTGTGA 
GTCTAGACGA 
TCAGTTCCAC 
TTTCATTATT 
GACCATCTGT 
TTCCATGGAT 
TGGAOGCTGT 
AGAT CC CTCC 
AQSS£^CA 



CACCACGCAC 
TTTAATAAAC 
ATAGCAGAAT 
TACACAAAAA 
AAGTTTTAGT 
CAAACATCTT 
TAGCTAATGC 
AAACAGCATG 
TAAGCACCCC 
TACA6AGGTT 
TCATTACTCT 
CTTCACATGC 
TACGATCCCC 
TAAGGATAGA 
TTCATGCTCC 
AGATCTCCTT 
TAAAGCAGCA 
CAATTGATAG 
ATCACTCGTG 
ACAGCTGCTC 
GGCTCATTTA 
ATTAtCTGCQ 
ACCATC TCGG 
CTCTGACTGC 
ACAGtGCCTA 
TTAGCTATTA 
ATTGTTCTGT 
GCTCA6TAAA 
TCTTAAATAT 
CCGACAACCC 
ATACACTGAT 
TTAOGTCATT 
GAGAAATTTA 
CATAGATAtr 
GGCAGATACT 
AGCCACAGAT 
CCCTAAACAA 
TTAGGTTATA 
ACTGAGGAAC 
CACCCATCAA 
TCCAATTTAC 
TCGTCCCTCA 



CTTGCTAAGT 
ACTCITCAGT 
ATACTC AAAA 
CTCT6ATTTT 

GGA6TTTCAT 
AACATATATC 
TTCTCACAGG 
CAGAAAGCTG 
AAACTACCAA 
GACATCTGGG 
TGGGGAAAGG 
ACATAATACT 
CA CTATT AAT 
TGCCTTTACG 
CATTCAGACA 
TGACGCCAGG 
ACAGAGAATA 
CAAAGACTCA 
ATCAGATAAT 
GGATGCCCAG 
GGAGACATAA 
TAT6CAAGGG 
CA6AGTCTGA 
TCrfcCTGGCA 
CTCGCTACAT 
AAACAGTA GT 
TTTTCTTGAC 
TCTGCCCTAC 
TGCACATTTA 
TATTATAAAA 
CCTGATTGTG 
AAACAGAACA 
CtTATTATAC 
TGATTCAAAC 
TCAATTAACT 
tACAGCATAA 
TGCAAATACT 
GACTCTAAAT 
TATCTCCTCC 
GCGCTACGAT 
CACCTAtAAT 



ACTCGTTCTT 
TCATGATIW 

AAGATATATC 
ACATTAACCT 
TATTTGCCGA 
ACTTTGTTTT 
AAGCATTTAT 
ACTGGCCCTC 
AGTGAGTAAA 
ACGACAAAGC 
ATAGATAAGA 
AACGAGAAAC 
GCAGCACTCT 
TTATTATTGC 
TTACTGAGTT 
TCAGAGGnr 
TTTTCATCTG 
GCTTTGCAAC 
GAAGTCATtA 
TCTGGCAACA 
AGGCTAGAAC 
GAAATGGCTA 
ATCCTG AATG 
CTTVTGTGAG 
TAtGATATAC 
TACAGCACGT 

acg ctcaggg 
atgcctt6tt 

ACGGCCATGG 

ACACtrrccT 

ACAATAGTCA 
AAGTAGCGCC 
TAC TCTTAAA 
CTATTTCTG7 
CCAGATCAAA 
CAACTATTTA 
AAGCCATTTT 
GGATGCAGGC 
CATCCATCTC 
TGCAGGCTCC 
CCCAGCACTT 



TGTTAGCTAG 
CTCATGTAAT 
CATTAnrtT 
ACATTTCAA6 
nCATTAOCA 
AACACAATAC 
9TCACAACTA 
CACACTTACT 
TCCTTCCCAC 
GTCAAGGGAT 
ATCCAAACCC 
ATAAGGATGA 
GCAATGTGAT 
CTTCTCCTAT 
AACACTTTGG 
ACAACTACCG 
CAATATATGT 
GGTCTTAAAA 
AACGTCAGGT 
ACCTAAGGAG 
COCTAATGAA 
CT OAACACTA 
pAGTCnOOG 
TTTTACTATO 
CATGAAGTGA 
AGTTAGGGAG 
GTGTCCGCAT 
CCGQACTAGA 
TGCTTCACTC 
AGAGGAACAG 
AAAAACGGAG 
TrrCATTATA 
AACATGAAAG 
CCTAATTTAC 
CTGATCTGAT 
AATATTCAAG 
CATAGCATTT 
TATATCCGAG 
ATGGATGCTA 
CATTTTCGTG 
CCACACCAT7 
TCACACGCCC 



CTTTtpTTCT 
TCCTGTTAGA 
GTTTGGACTT 
GTTTCAGAAG 
AGCTGXTAGT 
TCACrrCAGT 
CACTCACTTT 
TGTCAACTTT 
TCCAGACAtG 
AAAGAAAATT 
TrrCTTCTAt 
ACCTGOCTTT 
TCTGCAGCCA 
TAM^TCTCA 
AATTTTGAAA 
TGCCAGGTAC 
GATAAAATGT 
GATGAGTAGG 
ATTTCAOGAG 
ATTCGACAAT 
AfGATAGTCG 

GAcrmtcr 

TTACCTTGCA 
GATAAT66AT 
TTGGAAAGAT 
CTGGAGAGTG 
CTGTGGTAAG 
TTGTCCCAGC 
GACTTAATCA 
AGATCAACAA 
TAAATGTGAT 
GAGTAGGGAG 
ATTTGACAAA 
TCTAAAGTCT 
AAAAAAATGG 
ACATTGTATT 
ACTtGAGCAT 
tTTAGGAGTG 
ATTTTTITCC 
GCAAACGAGT 
ACCCACGCAC 



ICITCATTT'I 
AAACATTCAT 
AOCACtTTGA 
TATGTAATGA 
TATCACCAAA 
TCCCTAGGGG 
ATGTATTTCC 
CTAGAATCAA 
ATCTCACCTG 
TGTTGGAAAA 
ATAAGT6GTG 
TGGAOCCTOG 
TTGTA^OVG 
TTGGCTPCTA 
TTAAAACAGT 
TGTGTCAGCG 

CGTZCTCXCr 



TAAAACT AAA 
CTCCTTCGTT 



COTCTGTCTCl 



CCCA OC COCC 
AAA A GA C GGG 
TT6ATGTGAT 
AAOCCCTTAC 
GCAAAGTtAC 
AATCTGAAAT 
ACCATGGATA 
CAAGGCCTGT 
CCTAGCAACT 
ATTCAACTGC 
TTAGAAA1TA 
TAAGGAAGGA 
AAAAAC AGAT 
GTTAAGCSTC 
CTTTTTTCAC 
ATGGTfCCAT 
AGCTATTAAG 
CCACACATCT 
TCCAGGGCCA 
TTCACACTCC 
GAACGAAXrr 

Xtcacttcaq 



AGAAAAGCAA* 
CAGA0GAAG6 
ATCTAGTAGC 
TGAAATCAGA 
C7CACAAAGC 
AGATGATGAA 
GAAATAATGA 
AGATAATAAG 
AAAGGACITT 
TCTGTAAAAC 
TACTACGGGT 
CTAAGCGA1C 
CRXAAATGT 
«TCCA9A«Ce 
ATGCACCTAT' 

AGTTCAAATG 
TATGGACCAC 
CCtCTATGAA 
TGAQCACACf 
TCAGGAGACG 
TTCTGATACC 
AGOGCATAAA 
ATATATAAGG 
TGiQATGGAtA 
TTTACACCAG 
AAAGAGCTAC 
AACCAGCATA 
AGAGCTTGTG 
TCAACCACAC 
CTCTACTGAA 
AGAAACCTAG 

TGATATrrac 

AGTAAATGA6 
TCCGCTCTCT 
GAGAAATATG 
ffl;ACCACTy 



AgTTCCCACG CTGACGCAOG AGAATAGCTT GAACCTOGCA CATGGAGCTy CCAGTGACCT CAGATCGCAC CACTCCACTC CAGCCTCCQC CACACACCAA 



toCTCTCyCA AAAAAAAA AG AGTTGAAGAG 
GCA CTACAAC ACTCCATAAA 
TTCCTTWTG GAGTTACTCA 
CCCrraAAGA AAGGAACAAC 
CATCTGTGTG TATACAGCCA 
ACTTACCTAA ACCTAAATGG 
GACTGTACTG AACACTGTAG 
CTGTCGTATA ATOTGCTCCA 
TGTGTGTATA TATGTATACA 
PT^CACACA TATATCTATA TATATCTAC^ 



KATGICAAG 
TTCCmCTC 
UTtCCTAGA 
CiCACAAACA 
UtACAGTGT 
tKCAGCATGt 
UIGCGACCA 
ttTATATATC 



AAAAACTCTA 
GTGAACAGCt 
GCAAGTAG^C 
TCCAGGCAAC 
TGCAWCCTT 
TATAGCTTAC 
CCAATGCTAA 
TCATTGACCA 
TATATATCTG 
CACATATAT^T 



06ACAGA GG AGACGAGAGG AAGGAGCGAG GGAAGGACAA 



CCCTAAATTC 
GCAATGAAAA 
CCAAACACCC 
TTCTTGAGTC 
AACAATGCGG 
TACATACCTA 
CAGTGCTATT 
AAATCTCATT 
TACATATA9G 
25jATATAfA 
ATATGAT^ 



TAACGGAAGA 
CAGAAAGTTC 
CCTCTTATTA 
ATATATTCTC 
GGTTGTATTG 
TGTCTATCTA 
CTCCACCAAA 
TATACATATA 
TOTACACACA 
CATA6A0ACA 



AGTGAGCCCA 
AAGT7TAGTT 
ATCTTATAAG 
ATTTTATACA 
AGAAATCTGT 

atgtggocta 
aacXtagaaa 

TGATTATCTC 
TACACACACA 
TATATCTATA 
TCTATCCTCC 



AAACGAACTT 
CA TCTCC CTT 
CCCTTGCTCC 
TACACACACA 
CATTAA CCAA 
TTGCTCCTAG 
ACCTACACTG 
ATATATATAT 
TATATATGTA 
TATATATCTA 
AGAGTICAGG 



6CAGAGCAAG 
t^^ttccttt 
TCTTGAGATC 
TATATGTATA 
TTTCATCAtT 
GCTCCTGGGC 
AAAATACAGT 
ATAtATGATA 
CACACATATA 
CACACACACA 
AGTGTCTCTT 



GGAAAAGCAG 
CCTTTTTACr 
^ATCAGATA 
TACATGAAAA 
CTGCGAACAT 
TGCAAACCTG 
ATTATAACCT 
'ATATAT 

CATAGAGAGA 
CAGACTAOGT 



-1166 
*10S6 
'l»6i 
'1666 
1766 
^666 
1566 
-1466 
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•1266 
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-$66 
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-366 
-266 
-166 
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32 
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Sll 
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CTGCCCAACA AGCTCAAATC CTGAAACCCC CCCTTTACTA AAAATACAAA AATTACCTCG CCATOGTGGC GOGTGCCTCT AAlCCCACCf 4S11 



4611 
4711 
4811 
4911 
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Sill 
S211 
S311 
S411 
5S1J 
9611 
S711 
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Figure 8B 



JS^iSiiSS TAAAAIAAAC "aMCC TCCA . KnCtkohGh GATOCTTAAA VCACTGCIUkT TCCTATAACA CtTGCCAACC /UU^GGTOCTC TTGATCraJk 

iSSz^Hl 32S5i?T?i52 ?2S!S5H" I?"*??*' ctacwgacjLc acm:tggggt cacatttgca gctgcaccat aattaggctt ctgttcttca 

OCAGACATST GTTCAAAGTC ATRCGGCAA. CCATAnCTG AAAACACCCC AOCCAGOGTG ATGCATCACT TTGCAAAGAT CCTCAATGAG CTATtTTCAA 
C TgATCACA A AGTGTGAAGT TAACCGCTCA ATTTGAGTTC WlCill X it. ATCCAAACTA AATTCAAATA TGATTAGAAA TCTGACCTTT TATTACTCGA^ 
iFIwlEyS^' £55^AACTAA AAnGAATTT TAATTCCTAA ATCTCCATGT GTATACAGTA CTGTGCGAAC ATCACAGATT TTCGCtCCAT GCOCTAAAGA 
GAAATTGGCT TTCAGATTAT nCGAfTAAA AACAAAGACt nCTTAAiCAG. ATGTAAAATT TTCATGATGT 11 iCli U Ht CCTAAAACTA AAGAATTATf 

\ . -17 •• ■ -|. ' • * ■ i • . 

:L_ir' ^ ' * iL BU Clu' ABfi Ala Asn Ly« 11« Le» Aan Arg Pro* Lys Arg Tyr Aisn STec Gly Lye L«tt 

: " CCC AAC AAA ATT CTC AAT CGG CCA AAG AGG TAT AAT TCA GCT AAA TIG : 

Gltt Glu Pbe val Gin Gly Mn Leu Glb 'Acg Glii Cys 'Het^GliTGlu tys Cyo S<»ir Phe Glu Gla Ala Acg Glu Val Phe Glu 
GAA GAG TTT CTT CAA GGC AAC CTT ^^^AGA GAA TGT ATG GAA GAA AAG TGT AGT TTT GAA GAA CCA CGA GAA GT? TTT GAA . 

Aah Thr Clu Arg'Thr '- ■k^";^.-^ "^r Z^] • /:i:^-V> " ' ' ."• ',}• :.*'-•■ 

A^C ACT GAA AGA A^A^G TGSACTATTTC CA,Cf TAATAC CCTTCAGATC CAGAGCATAG AATA^AAAAT. CTTTJ^^ ACACTTCTCT TTAAAATTTT 
'«»^'<SWM:^*<»*'^AiA AOiTAOaUUl^^T^ TAATTTGTCT TCT iyiA TTC 

Thr Glu Phe Trp Lys ^n-Tyc Val . •5 *^T^*.r-rV^ ^f-f"- : -f^V? V' <^*V-iii-^ -^r:- ■; . . • ; ^> • ■ 

.TrTATAG^CT GAA TTT TGG AA$? CAG TAT GTT G^ TAAGCAATTC AITTTATCCT CTAGCTAATA TATGAAACAT ATGAGAAITA TGTGOqTTTT 

TTCTCTGCAT AAATAGATAA TATATTAAAC TrTGTCAAAA GGACTCAGAA AGATCAGTCC AACCCTCTAA CCCATATTGG ATGGTGATAT ACTACAGGGT 
TATGCCAGTG TGGGAACTAT CGCTGGTAAA TAAGTTTAAT CCTCCCTAGG GCTTCACAAA GAACATTGTT CCACCCCAGG AGGGTGGAAG GAAGAAACTG 
AAATGATTGT GTCTTAGAAC CTAATGAAAG TTTGCATTCC TCACTAAAAT CAGAGACTGC TGATTGACTT AAATGTTTAT ACCTTCAAAG TCCTCCTCAT 
TAT CATGG CC CAGAAGCCCT TCCATGATTG TCCTTCCCCA CCCTCCCCAT' TACCCTTCTT GCCTCCTCTG CTACTTCTCT CCTCGCACAC TGGGCTCCAG 
CCAOCCTQCC CTTCCTGTCA CTTCTTGC^. ACTCTAGCAA TCCTCCCXCT TTGGAGGCTT TATCTGGCTG TTTCTCTTAT TTGGCTOTTC CCAACTTCCT 
GT^GCTGAC TCCCTCACCT ^TTCGGGTfe TTTGCCCAAA. TGTTACCATC TTAATGAOOC CTACCTTCAC CATCTATTAA TACTTCAACt ' TGCCCCAQrA' 
CCCTT ACCAC TCTAGACACC TGTACAGAAC TC CACyT AC TTTTTAACAG ACCTTTTCAC CATCTAATCT ATCATATAAT TTCTTACTCA TACTATTTAT 
GATTtATTTT CTCCTACTOC ACTAAAATGC AAGTTTCAT6 TTGGCAGGGA TATTCAATTG TTTTGTTTAT TGATATATTC CTAGCACCTA GAACAGTAtC 
TGGAAAAGAQ CTACT CAGTA AATATTTATC AAATGAATTG ACCAAAAGAA GCAAAACTCA AAACTTtAAT GACAACTAAC TTTAAAGCTA CAATAACTTA • 
AAAT TCAG AG TAGGATTTTG AGGGAGGGTA AGTTTCAAAG ATTGACTTAC CTAAGACTAT CTGCATAAAT AAAAAGAAAT TAATCCAGAC AACAAATTCA • 
CCAACTTCCA TCAATTGGAA ATCCAATTCT ATTTTCTACA GTTTATGTTC TGGAGACACT ACTGGACACT CTTTTACTCT CATAACTCAT AACTCCTCCA 
CTTTTGTTTT TAAATCATGA GAGAAAAAGA GTTGACTCTG TTATATTGTT TTATCTACCT TTCCTTGATC TTAGAAACGA ATACTACCAT ACCAGCTTCT 
ACTGAGGTGC CCCCTAAAGT TAGTCCAAAT AGGTCrTTGC AATCTCCATT CCCGCAGAAT TTAGAACTTT GAATCACATG ATTTATTTCT AAAAGTAAAT 
CCATGCCGAT TTTCCCCACC AAAAAATTCC TCACTATTAA ACTCCTACAA TCCCTTCATT GCTCACTCCC CACCCCCAGG ATCATATTTT AAACTTGGGC 
CCTTGCCTTT TGGGTCACAT AGGTACACTG TTTGCTATAC CACAGGTATA GCTATCTCGA AAACATGGAG GGTATTATTC TGTTACTACt GCTTCGTCAA 
. CCAAAAA ATA AAACAAAACA ACAACA AAA^ AGAAACAAAC TCCCTGCCTC TTTTCAtTTG CAGTCAAGGT TCCTAACCAC TACAAAATTA GCCTATCTTT 
CTTCTTGCAC ATAGTAGAAA CC CAAGC TTC TCACTGCTGT GCTATTCTCT ACCATCAACT CAltrXCATAA AGAGCCTOGT TGAAGAATGA TTGTCCAACC 
ACATTACTAG CATCTGTCAA GACTTTCCAG TTTACAAAAG GCCTATCACA TTTAACCCTC ACACCKtCCt TGTGACCAAA GCATTATTAA CTCCATTTTA 
CAGGAGAGTA A ACTG AAGCT TAGGGAAGTT AAAAGAACTG CCAAAGGTCT CCCAGTTGGG GAGTCATQAA GCCCAGAAGA GAAGCCAAAT TCTCTGCTGC 
TCAACCCCTT GCTTTCACTA TTACACCTCA GGGCCTTCAA ATCTAAATGC ACTTATTCAT TAAACAQGAA CCTCGTAGTC TTAAACAGGA ATCTCTCACT 
TG6TAAGATC TTGTCTCTTG TTGTATTTGA CCCCAACTGT CTATGGCTTT GCCTGAACCC AAAGTACACA CAGCCTAGAA ACCAAAGGAG AACCAAATGT 
GGGATAAAAt 6ACACTCATT TTAACGACAT OTCtCAGCAA ATGAGTTCCT GTGTAGCTGG CTGAAAGCCC AGACCCTTTC AGTAAAACAT CCTGAA:TAAT 
TCACATTTGT T6GTCTATAA TATAAAGGGC AAATGTAGCT. CATTTTTAGA CCAGTTCTGA ACATCAATAG TAACAAACCA GAGATAACCG ATTTTGTTTT 
CATAGAATTG GAACAAATTA GAGTATCTGT GCAAAAGCAT ATCAGATCTA GGAGCAGAGG GGACAAGGTC TAATTTTTAA ATAAGCAAAT TTTCCAGGGA 
GGGACTACTT ATGATAAAGG GATATTAGTC TCTTAGTCAA CGGAACCTCG ATACACGCTT CTCACAGAGA AGACCGAGAA TACGCAGGAA TCTACACACC 
AGATGTCAAG GAGATT TGCT TTAAAATACG ACTGATAAtT AGAAATTTCT CAGTTTCCCC CrmXXCl'C A T TCTTTGAT TCTTATTCTT ATtTTTATCT 
C TTACTC CW TGTTTCTCAT ATATTGAGTC TTA<:AGATCA AGCTCCCATT TTTTTCTTCA GGGGTATTTT TCTAGTTCAA ACTGCCTACC ATCTCCCTTC 
TOGTTCTATX CATCCTTCTC TCCCAAAGCT CCTTTAGAAG TGTGGATTAA GGCAGAGCAC TAAGAAACCA GACTTAAAGA TTCCCTTCTC ATTCTGACTT 
TTCTCCTTTC ACCTATTCCT TOCTCCTGTT TTCTTACCAT CAGTGTCTTC AAAGGCTTTC AAGTACACGG TAAATGCAGA AACTTCAAGA aaggcagaat 
GGAAACATAA CCAATGCATA CATAAATAAA GCACACTGTA GAATCTTTTT AAATTCTGTA TGATATATCG AATGCTGTCT CTCACATTAC CTAGACCATT 
TGAAACCCAA TTTGTAAAAC ATAGACTATC TTTAAGTAGT AACAGATGCT TCTGACATGT TTTCTATTGT CTTGAACCAT TACTGCATAT GATACATCAA 
AGTTAAGTGA CAATACAAGA AAGCAGATTC ATTTGCTCCC TGCCTAGGCC GTCAGTTCCT AAAGTGGAAA CGCCATATAT TAtCTAGCTC AGTTTGCTCT 
ACAAGACCTC CAATAGAGCC TTGTGTGACA TAGAGATAAT ATTTGTTGAA GCAATTAAAT TtqACTTGGA ATTAACTCTG CCATCATTCT ATAAGGAAGG 
ATTGAAAATC CTTCTCACCC TGTOCTGATA TAGTACCTTT CTATACAAAA ACGTCCTTCT CCCTCtTCCC TTGGATTGCA TAAACTATGT ACATGCCTTC 
CTCAGGGGCA C7TTTCTAGG ACACTGICAG CCTAAGGATC TTTGTTTGGG TGGCTXTTA6 AAACTCAGGA AGACAGGAGC ATCATATGCC tKtAGGCAGC 
TGQCTTCCAG G1CAGTAGTT TT CC TCTGAC CCTAAAATCA GACXCCCATC CCAATCAGTA TCTACAOGGG AGGACCCCGC ATTCTAACCA CTtTACGTCC 
...... V ft.'. -f-:.^--.*r, V 47 V- . ■ -. . . •* 

• »■■ Asp 6ly Asp Gin Cys Glii 8er Aan Pro Cys Leu Asn Gly Cly'Ser" Cys Lys Asp Asp lie 

CAATTCAATT TCTTAACCTA TCTCAAAG AT GGA GAT CAG TGT GAG TCC AAT CCA TOT TTA AAT GGC GGC AGT TGC AAG GAT GAC ATT 
- - -r •••• • ♦ • , • ■ 84 . ■■ . 

Aari 8er Tyr Gin Cys Trp Cys Pro Phe Gly Phe Clu Gly Lys Asn Cys Glu 'Leu 

AAT TCC TAT GAA TGT TGG TGT CCC TTT GGA TTT GAA GGA AAG AAC TGT GAA TTA G^GT AAGTAACTAT TTTTTGAATA CTCATGGTTC 



sail 

«1U 
«31l 



can 

6474 

«$7| 
6C7B 

(743 

6963 
7041 
7143 
7243 
7343 
7443 
7563 
7443 
7743 
7843 
7943 
8043 
8143 
8243 
8343 
8443 
8S63 
8643 
6743 
8843 
8963 
9063 
9163 
9243 
9363 
946) 
9563 
9663 
9763 
9863 
996) 
1006) 
10163 
10263 
10363 



10450 



AAAGTTTCCC 
CTAGTCATAG 
TATT AATCTT 
ATAAATGGTT 
ATCATGACTA 
ACAAAGCACT 
ATCCTTGTGT 
ATTACCTCCA 
TGCAATCCTA 
ACAGATGAAA 
AGTACTAAAA 
AAAAGAG TGA 
TGTGTTAATT 
AAACCTCTAG 
T6AAGCGCAC 
6AATGTAAAT 
GATACTCTCA 
CAATCTGATC 
GCTACAGCAA 
GGCCCCAAGT 
CACTGCCAGA 
ATCTC CCTAG 
TTCCCGCAGT 
CTACCTATTT 
QCTTTGGTTG 

aagtgtttgt 
c catatccac 
attocttotc 

CAAGOCAAAG 
ACAGAATCAC 



TCTGAAACAA 
TTTTCAG2AT 
TAATACAACC 
TAGACAGGAC 
AATTGCTATC 
TCATATGCTA 
ACCTTTTCTT 
GATGCAATAT 
AGGAAGAATT 
AAGGCATGAA 
AATTCATTGA 
ATGAACTTGC 
GATTCAATAT 
AAATCCTTGG 
ATTGTTGTAA 
ATAAGCACAA 
TGTTCCCACA 
AAGGTTTAGA 
AAGCATAAAC 

ciiriGicri 

GAGAGGTGTA 
CTCTACATCA 
AAAGTAGAAT 
CTGAGGAGTA 
TCACACAAGT 
GTTAACATGG 
CACA6TACCT 
CCATTATTTT 
ATCAACACCA 
AATTGAGAAA 



GTTGAAACTG 
ATGAGTTCAA 
GTATGTGGTT 
TTAAACTTCA 
AACATAGGTG 
AACTTGTTGA 
CTCAAACCCT 
TACTGGTACT 
TAACACCACA 
ATGAAATGAA 
CCATCTTCCT 
ACACCTGGCT 
CTCTGAAGTG 
AGTAGTTACT 
ACTATAATTG 
AACTT CTA CA 

AAAATATTTT 
AAATTACCAT 
ATAAGGTCTT 
AACGTTCAGC 
AACCCAAATA 
GGAAAGAAAA 
AAAAAAGGAA 
CGATATATCA 
AGGGAGGGAG 
CACACATAAA 
ATTTCTGGAA 
GCAGCAACAT 
ACACAGAGTT 



GAAAATGCAA 
GAAGCTACAT 
ACTACT ATCA 
GTGTGACCAA 
AAAGTCAATT 
CAACATTCAA 
AGATTATTTC 
GTCATTGAGA 
GGCTACATCA 
ATGTAGCAGC 
TTAGTCTCGA 
TGG ACTCC CC 
CTACTTTCAT 
CATTATCAGC 
AAAACATAAA 
ACAGAGTTTG 
GTCAAGGCTC 
CAGC CCAC TT 
TTAAGTTTGT 
GAAAAAAAGA 
GCATGCATTC 
TCTCTCAGAT 
CAAAAATCAC 
ATATGTTGAG 
GGAAATATAA 
AACAGATGCT 
TGCTCAGTAA 
TCTT CA CCCT 
ACTGAGCCCT 
TATTCATTCC 



TATTGCTGTA 
TAAAATCAAT 
TTATGCGCAT 
AACCCATGCT 
AAGTACAGAA 
AGCTCATCCC 
TTTTTCCGAC 
AACGCACACA 
CTAACGATAA 
TACACTCGTC 
AAATTTCTTA 
ATTCCCCTTA 
CTGAAAGGTT 
T TAAATAA TA 
TATCTTTACC 
CTACGTGTGT 
TCCACTTCTA 
ACCTCACAAA 
TGCTTCAAAT 
AAGGAGATTT 
TAGTTAAACA 
AAATGAAGGT 
AAGCCTATAA 
TITAGCTCAA 
AGGCAGAATA 
TTGAGATGTT 
TTCATAAAtO 
TTTAOCTGAG 
AAAOGGCAAT 
CATTGTGCAT 



TCATAATTTT 
AACAATATTT 
TCTATGCAGA 
TCTAACTACT 
CTGGACTATG 
TGAATT TCAC 
GTtTTCAGTA 
GTGCTAATGA 
AGAGTACAAA 
CTATTGAGAA 
GAAGGTAAAA 
OGTTTCCATT 
ATAATTTGAA 
TAOCCGGTGG 
TAGTTTAAAA 

CTGCACGCCA 
CAAAATGAGA 
TAAAGACTAA 
TCATCAATAA 
ATA1TAATTC 
CTGTAAGAAT 
ACACCTTCTT 
GCACAGCATA 
AACTAAAGCA 
CTTCAACAGA 
AGTCCCATTC 
CGCAAAAGAT 
CACAAKIGtG 
GGCCTGGACA 



TCTTAAAAAC 
GGTAACTAAT 
TGAGAAAACC 
ATATTCAAAA 
ACTGGCCAAT 
TATATTGATT 
AiTGGAGCAG 
GGAATTCACT 
ATTAGCAGGA 
AGGAAAAAAG 
AAAGGAAAAG 
GTGGGGGACA 
ATTCAGATTT 
AGCTGAG06A 
AATAAA6ATG 
CACCCAGCAA 
CCACCACTCC 
ATTCCCACAA 
TTGCAACACC 
CAGTTTTTTG 
GCTATTCGCT 
TTCCTCATTC 
CAATTTTCCC 
TTTTGTGGTA 
GAACATACTA 
TATTCTAGGC 
TAACTGTTCC 
TGCtGATTAG 
GA6AATGATA 
AACCAACC9G 



ATACCTTTGA 
ATTAAGTAAt 
GCAACtCCAA 
CTCAGAGAAA 
TATCCCATAT 
ACATCGAAAA 
TAAACCCCAG 
TTCTACTCTG 
CTCTATTAAC 
TCACCTGTAA 
GTGACAGGGC 
AACTAATGCC 
ACCTGGATAA 
ATGAGTAACT 
CTTTAAAAGG 
AAATGCTAAG 
TGCCCTCTAC 
ATTCCTCTTT 
TACTAGATAG 
TTAtCTTTTT 
CCCTTTGGTT 
CTGTCTCTTC 
ACCATCTCAC 
AACTTGTTAA 
ACATTTGTAG 
ACTGAGACCC 
TTACCCCTGC 
GAAGCAATAT 
CA CACCTC TC 
CACCTTTCCT 



TGCTTATAAA 

AATCATGTTC 

CGOCCAAAAA 

ACTCAACCCA 

AATGOGAATt 

TCTT ACATAG 

TGTCCCTTAC 

ACACTCTGGA 

CGATTACAGC 

TCTTCTTCAG 

AAAGACATTT 

TGGCTTACCT 

ATTTGATCTT 

CAATTAGtCT 

AGGAAGCGAA 

TCTACAAC7G 

AGTATATTGA 

ATCTCAGACA 

CACAGTTTAT 

CCCTTGTTCA 

AGAATAAAGA 

TAAAGAGtAA 

AGiACACTACT 

ATAAAACATC 

TAGGCATCAA 

CCTTCGCGAC 

TCTATG6AAC 

TIGCCACCTC 

GTTACTTCTT 

AACTTATCAC 



CATTTCATTT 
CGACTCACCt 
TTACAGA6GC . 
GAAAATTGJIA 
CTCCACATGT 
CAACCTTAGA 
CTACTTGTTT 
AGAATAGAGA 
AATCC ACCTC 
AAATCCTTTC 
GAAAAGAAAG 
TTCTTGAGAG 
GCTATTATGG 
CAGtTACAAC 
TAGCOCTGAG 
ACACAACTTG 
TCTGACCCAC 
ACAGAGGAAA 
GCGGCATCTC 
TCAGGCCCTT 
CCTCTGTATC 
CAGACG CATT 
GTCTTATTTA 
TTTTGTCCAA 
GGGAATTAGfL. 
CAiSgC ACACC 
TCTCCCCT6A 
CTGCGCAAAA 
AOCCAATGAC 
AATCyCATAT 
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11437 
11537 
11637 

117) 7 

118) 1 

119) 1 

120) 1 
12131 
12231 
12331 
12431 
12531 
12637 
12731 

128) 1 

129) 1 
1303? 
1313' 
1329; 
1S331 

134) -' 

135) ' 



wo 00/75279 



PCT/USOO/15728 



10/35 
Figure 8C 



fOlCGGMCA CTTTCTACA6 OTAAT6TTTG ATTrGGCTCJl ACACTtTAGC ATICCTTGOT AGCAACMUUi TCATAGCTAC TAACACAAAA AGAfCCAGOa 
mTTACCAC TCTTAGTGAG OACAAAOGCC TTTTAA1TAA TTAArTAATT AATTAAtAGG ACCAA6T0CC AT CAIAilW Q ATCATOCCCT TAGICttTTA 
fTOGTAGCAA AOGTTAAAOC 9CAAGCTGGT TCCmCTCC CCCTOOCAAC AGTTGArm CCVCCCTTAT CTCCTCAAGT ACCCTAAGGA CTAAGAGcS 
A mtTACAT tTCOC TATOC SA6CATATGT AAAATAQAGT TZAAAACm ACAmATCA CTCAAAAATT CATATTCTCC AAAACCATAC ACTCAcfSc 
TVACCCfCTG nCOCCCAfiA AAAAAAG9CA CAAGCTTAtT AtTAACATGT CCAATCCAGC CCCAACA C AA AGCAACT6AA GATGAGOCAC AAACGjSiuJfi 
JLAACCCAATA AGAGOATCAC TTATCAAACT ACXCGTTrCT TAACAGCAAC TGATTGCTTA ACITCCTCCG ACTGTCTCCA ATAAGTCAAA TTGOCCKAC 
CffACTCCAC CTGAC90GGA AGAAGCGGTG AAAGAAYTTG TCTCTCAGTA TCTGT C TCT C ATTCGTTAGA AGTTCGACTT ATGOGGAATT AACTCCCTCA 
CATTTCCTA6 TTGGA7ACCT TGGGTACCAC AGGCATATGC CATCCATGCT CAGCATGAAC AGGGAAGCTT CAAGGCAAAA CACACATACT GCA6CTATCA 
CCCAACGCAA tTCAACCATA CACCCA9AGG ACCCTGGTTG ACATCCACCC AGAGCTAATC ACCAOCATGC TGGAAAAAGA CACAGGTOAA GCTGAGAACA 
mMOGTGC rCCATACGAG 6TA1CTAATA CAGTCACTCA TT7TCAAACT TTCCATCTTA TCAnCCACT GACCACTCAC CATTTCTATT GAAAGTTTTA 
CTCrrCTCAA ACACCTACAC AAGGGGAAAC GTGTCrrACA TTGTTTATGT TCC T GTGCTG CTCTAGAAAC AGAAATAiSGC TCAAGAGCAG AGCCTGTTn 
TCTTAATXCA GCAGGTCTAA OCTAACAAGT CCTGAAACAT CGTACTTCCT GTYATTGGTA TTGCATACGA GAAACAAAOC GAAAGCACAG TAATTA6XAA 
ATACAAACAA QATGGCAGCA AT AAGCC AAA AATATCAGGA AACACAATTA TTGTGAATTG GGATTAAACT AATCTATTAA TAATGACAAC TtXCAGCTTG 
OCTTAAAAA TTTAATTGTA TACTGTTAAC GAAASTGATA CCTAAAATAA AATTACACTC GGAGGCCAAA ATGAACCCAT CICAAAACAA CTATCA6GTA 
/MAACTAACA AAAA O AAACT A C CAAAGCAA TCTtAATATC AGACAAAATA GAATCCAAGA GCAAAATCAT nCAAAAGAC AAGAGATTTT TTITATTAAT 
MCC6GAATT GCATA^GAGA GTAAAGAAAA 9GTGQGCCAC tCGAATCCTT ACCACTAATC ACATATTGGT CTPTCCTCTT CACTTACCTT ACACGACCCT 
J CTTCAT TCT CTTA TGTW C ATATOTAACC ACCTCAGCCA GCTrCAAGTT G CAAAIAG CC CCTAATGGAC tTCCTAGCAC TATAATTTCT TlTTTmAA 
ATCTTTTATT TTAGGTrTAG GGGTACAfCT GAAGGTTTCT TACATAGATA AACATGTGTC ACAGGCCTIT GTTCTACATA TTATTACATC ACGCAGATAT 
TCAOCTCAGT ACCAAATAGT GATCtmCT CCTCCTCTOC CTCATCCCAC CCTCCTCCCT CAAGTAGACT CCAGTATCTG TIGTTTCCTT CTTTCTCWT 
AmCTTCTT AACACTTACC TCCCGCTT A C AAGTCACAAC CTGCAGTATT TCATTTTTGT TCCTACGCTA GTTTCCrAAG CATGATAGCC TCCAGCTCCA 
ffCATATTCC CACAAAAGAC ATAATCtCCT TCTTTI C TAT GGCTGCATAA TATTCCATCC TATATATGAA CCACATXTTC TTTATCCACT CTCTCATTCA 
tCCCCArtTA CCTTCATTCC ATCTCtCCTA TtCTAXCACT CTAATTTCTA AACACITCCA GATTCtACTT ITATAGGTAA CCTGTTAAAC AGTCTAGCTC 
TGQAACCCAA GCAATTTCTA GAATAACTAA GCAATAGAAA TTACACTTCA AtGCAGAAAG 6CAGTATCTA CATGAGA1TA TGAAATTGC6 CTTG C a iXIJ 
CtGITCACTG AAAAAAATAA GTAAAACTGT AACTTTCAGA AAAAATGATT GTACATATAG AAAACCCAAA GCATCTAAAC AATTAAAATA AATAAGTATA 
CAAAGATTAC TOGATACAGA GTCAACATAC AAATATCAAT 7GTATGTCTA TATACCAGCA ACGATXCAAA AATGAT1TTT ATAATAGCAT TAAAAATTAC 
ACBCTTAGTA ATAAAtqTCA CAAAGAVCtG CAACAACTCT ACATAAAAAA VTATGAGAC6 TTATTCACAA AAATTAAGGA AAACCTAAAT AAATGAATGA 
AtAGCCAATG TtTATCATTA AAG6ATACAA TATACTAAAT ATATCAAATG TTTACTAATG GATTCAATGC AATACCAAAG TCCCAGCACC CTTTTTTGC? 
OCtOCCAGGT CGGGCAGGAT 9CATAAGCTA AITATAAAAT CCATATGGAA ATGCAAAGAG CCAAGGATAG CCAAGACAGT TTTGAGGAAG AATAAACm 
XACTACTTAC ACTACCAGAT GTCAAGACTT AtTATCGAGT TACATTtATt AAGACAGTGt GGTACTGACA CAAGGATAGA CAAATAGATC AGTCAAACAC 
ftCTACAGTGC TCAGAAGCAC ACCTGtACAT ATATAAAGCC TTGATTTATG ATAGAGCTCC CAGTGCAGTA GAGAAGGAAA TTATTGCTGT TITCAATAAA 
WCKATAOG TCAATTAGAT ATTCATATGG CATGAAGTAT GAAACAATAA CAAITTATAT TCATAACTTC CAGAAACCAA AAATTTCTTA AAATACAAAA 
KCTGATCACC ATAXAGGAAA AGATTGATAA ACTGGACTAT ATTAAAACTA AGGACTCCTG TTCAGCAAAA GACACTACrr CGACTGAAAA GACAAGTCAC 
ACAGTGAGAC AAGATA7CTC CAATACAGAT ACCTAATAAC TGAACCCCAT ACAGTGATGG TGGGAATTTA ACttCGTACA AlCATrTTAG AAAATTGCTT 
OOCAGTATCT ACTA^TCTG AACATGTGAT CCAGTAATTA CACTCATAAt TATAAGGCAG 9AAAAAGOCA T G tlT A TCTC ACOUIAAGAT ATAtACAAGA 
JOGITCATTA CACTATTATA CATAAGAGCC AAAAACTGGA AACAAACCAA ATATCCATTA ACAGTACAAT GAATAAATAA AA6CTGTAAT AGTAATACAG 
reCAATACTA CACAGCAATG TAAATGAACT ACTCCTCT A C AAAACAACAT GGTnAATCT CACAGACAAA ATGTTAAA1C AAAGACACAC ACGAGTACAT 
mCCCAACT TCTGTTTATA ATTCAAGAAC VGGCAAGAAC TCnTACTGT GTTXGAAGVC CAGGTAATGG TAACCTATAA AAAGGAAAAA OGGTGGAATG 
mCGGAGGG CGCATC VrCT GGGGTC9TGA fAATGTGCtA TCTATTGGt C AGTTTAGtGt TTAAACAOGC TCATTTACTT TGTGAAAACT TACACTAAAA 
TKICTGTAT WTTTGAATA TATCnATAC ATTAATAAAT AGGCTTTTTA AACCTCTACT TCATAATTTA GTGAAACTAG AATATCCAAA CATITAGTIT 
TAAACCAATC AATTATAGTG CTACCATCAT TTTTATCCAT TATTCAGAAG TTTATTTTAC C TTT CT TTC C ACrdTATTT CAAGGCTCCA AAATTTCTCT 
CCCCXACGTA TATTCCCCCC AACATGAATC CCCCCAATGT ATATTTCACC CATACATGAG TCACTACtTC CATCTACTTT TTAGAAATGC ATGTTAAATG 

85 • 

__ A«p Val The Cy» Aen lie lye Asn Gly Ar9 Cya Gla Gin Phe Cy« Lys Asn Ser AI« 

ASGCTGTTAC TGTCTATTTT CCTlCmTA G. AT CTA ACA TCT AAC ATT AAG AAT OGC AGA TCC GAG CAG TTT TGT AAA AAT AGT CCT 

♦ .127 
Aip Asn Lys Val Val Cys Scr Cyi Tbz Glu Gly Tyr Ar^ Leu Ala Glu Asn Gin £yB 8ar Cys Clu Pro Ala 
CAT AAC AAG GTG GTT TGC TCC TGT ACT GAG GGA TAT CGA CTT GCA GAA AAC CAG AAG TCC TGT GAA CCA GCA G^GTCATAATCT 

6MTAAGATT TTTTAAAGAA AATCTGTATC TGAAACTTCA GCATTTTAAC AAACCTACAT AATTTTAATT CCTACTTGAA TCTCCTTCCT TTTCAAATCA 
T&CAAAATAT CACT ACCTTC AATTAGACCA ATTAATTTTC TAGATTCCAT CATATTTTAA ATATAACTAT CTAATCATCT ACAACCTGAA TTClTT C TCr 
eiCCAATTTG TCCAATTTTT TTCTCTAACA TTTATATCAC AAACCAATTA ATTTGTGTGA TTTCTCCAtA TCTATTTGTA ATTCATCAAG TCAAATCAAT 
CTAGTAATAC TATATCATAA AATA TACACA AATAATTCAG TGATAGGCTT CTAGTATAAC GACGGTAACt TTGAAGCATG ATTCTATCTC CGCTGGCTAG 
TTTAC TCTGA GAAAGTTATT TTTTATTGTT GCCT C TTAAG CTGAG7TTAC ACACTT GC T C TCAGAAtGAT TCCGGCAATG AACTGTTTTA TCTTCTGCTA 
GGCTGATCAG CACAATCTAT ATGGCTGTGA ACAAAACAAT GTTTCCCACT CATACCAACC ATGCCACCAT TTTAACACCT GATTACTCTA TTCAGAACAT 
CTOCACrCCA T6TTCGTATG GCTGTfATCT AAAGATCAAA GCACTAGACA CTTTTATTTt TTGAAAAATT TAGGCTCTGC AGGGTCAATT ATATTTCATA 
UTCAC CQCC TTTTTTGAAG CAAACTAGAT ATAATTTCTT TTCCATPICT AAAGCCTCAT ATCTTATTAA TTCGTACATT AAATTGTGCA CCATTTCTCT 
CrMCTGTTT CAGTACCTGT CTCAGCACTA TACCAGGCAG AAGAAATAAA G AAAGAACCA GTGCCAGATC AGCTTGGTCA GGAGACCCTA ATCCTGCGGC 
AC T ACA C CAA TTAAAGACAC ACACACAGAA ATATAGACTA TGGAGTG6GA AATCA6GCGT CTCACAGCCT TCAGAGCTGA GAGCCCCGAA CAGAGATTTA 
OCCACATATT TA1T6ACAGC AAGCCAGTCA TAAGATTTAC TGAAAGTATT CCttATGGGA AATAAAGGGA TGAGTCTGGC TAGTTATCt6 CAGCAGGAAC 
Wg'CCT TAA CGC A CAAATC ACTTATGCAA TTGTCTGT^ TTTAAGAACA CCTTTAAGCA GTTTTCCGCC CTGGGTGGGC CAGGTGTTCC TTGCCCTCAT 
tCTQCTAAAC CCACAACCTT CCAGTGTGGA TATCAACGCC ATCACGAGCA TATCACAGTG CTGCAGAGAT TTTGTTTATG CCCACTTTTC GGGCCAGTTT 
UCeCCACAT TTGGAGGCCT CTTCCCAACA AACCAGAAGC TAGGAATATA TATCCTGCAA ATAAAATGAA GAATCTCTAA GGCTTCAGGO CCTGCCCACT 
WnCTTCTG CCTGCTTCTT CACATACACT GTCTCAAAGC TACTCTACCT TGAGAGGAGC ATGAATATGT CTCTGOGTCT GTCTCTCTCT ATTTTAACCT 
nUAACCTA ACTTCCAGTA TACACAGATG GCATACTA6C TAAACCCTTA CAAGTTCTTC TATGCTATAA AAGAGAAACA GAATTGAGAA CCACCTCCAA 
gATTA ACTG TTATATTTGA ATATACCCTT AGCTTTAGCA GAATAACTAC GCCAAACTTA AAATAAGCTT TTCTGCCTTT TCAATGATAA ACGTCCCTTT 
TCKTAGCCA TTGTTGATTG TCTACACTTA TACATAAGTA TTTTGAACTA ATTTCCTCTT TTCTCAACCA CITOCTGTCT TCATGATACT TTGTCGCAGC 
mrcCTAT ACAAATCTCT CTTACAAGGA ATGTGGCTTG AAGGAAAGTG ATAAATGAAA ATCAAATCTG AAGTGACTTT CTTTCACT^C AAATTCCCAT 

ktggtagtc CCCAGTGTAT CAATACATTA TTTTTCTTTA GAAAATAAAC caacccaagg aaaaatgctg gccaggtcct ggtgaatatg gctgtgataa 

tt ATATTA GC AATCTCTRG OCTAATATIT CAAGCCCAAA TAATTGAATC ACAATGATCT CTCCCCAGAA AATATATAAA ATGCACCTTG GAATCTAGAA 
C6KmTAG TCTGCAAAAG AAACCTTCTT AATCATAAGC AGCAGAAGTC CCATTTACCA AATTGGAAAG TTAAAGTTAC AAAGCATCAA TCATCAGACt 
KCATTCAGG GATGGCAATT GCGAGTAAGA CTTTTTAGTA AAGAAACTAA ACACAAAGTC ATTAGACTCT GTAAAAGTCT TACCAAATTT 6ATTCT06AA 
9^0»^ TTCT ATTTCCGTAA AGATGATGAA TTCGGAGCCA AAtGTTCTTT TCATGAAGGA TTTGAAAACt GTCCATGAAA ATAACOCAAT CAACCTTTTA 
CCnCAGACT CTATTCACTG ATTAGATTIT TTTAAATACT GATGGGCCTG CITCTCAGAA GTGACAAGGA TCGGCCTCAA TCTCAATTTT TGTAATACAT 
• • . * - • 128 

Val Pro Fha Fro Cya Gly Arg 
TCTAG JKS CCA TTT CCA TGT GGA AGA 



Val Sor Val Sar Gin 

GTT TCT GTT TCA CAA 



CncCATTTG CCAAT6AGAA ATATCAQSTT ACTAATTTTT CTTCTATTTT 

!^ !i£ Val Fho Pro Aap Val Asp Tyr val Asn Sar Thr Glu Ala Glo Tbs lie Lau Aap 

Ut TCT AAG CTC AOC CCT CCT GAG OCT CTT TTT CCT GAT GTG GAC TAT GTA AAT. TCT ACT GAA GCT GAA ACC ATT TT6 GAT 

)n Zla Thr Gin ficr Tbr Cln Sar Fba Asa Aap Pha Tht Aig Val Val Gly Gly Glu Aap Ala Lya Pee Gly Gin Pba Pro 

Iff ^ ^ ^ ^'^ 

tcp Gla ' 

KC CAG CT ACTTSATACT GATCGTGTGr CAAAACTGGA GCTCACCTGG CAAGACACAC GCCAGGTGGG ACACTCAGGC TATTTTACTA CACAGA CC TA 



neCGATGTG 
«ICAGCTC 
iOCACCTTC 



ttlCAGAATC 
f mCA A CC A 



ACAAGIATTT 
AAGAAGAATT 
AOCCCACTAC 
CTCAAGGA06 
AOCTCGGAAC 
CTGCAOGTGA 



ACGCAA C TTT 
TGGCATTAAG 
TTCAGCTTCA 
AACCATTAAG 
CTAGAAATGC 
TTTTGATGCA 



TGCTTAAACC CAGCACTTTC 



CAGCACTAAC CAATGTGAGA ACGCCTCCAG AGATGAGCAG TTGGTGAAAG AGAGGCTCAA AACCAOCTAC 
GAAACAGCAT ACCAGGATTC CAGACAGCCA ACTGGTCAAC AACATGAAGG TCTGGAAGAA AGGTCGCAGT 
CCCCTTCCAA AAACTGGTGA GAGTTCGAAA GTCTTTAGGC TAAGAAAAAT TCGATTATTT AAAA G GCCTA 
GCAAGAACTA GGTTCCAAGA AACAOGGCAT GACAGAGAGT CTTGATCTAC CACTATAGTT CTCGTGGTAG 
AAATTCTCCT CCTCTACACT AGACCTACCA AATCAGAATA TCTACOCGGT CGGCCCCAGC A C TC T CTCCG 
CATTATACTX TCAAAACT AO QCCAQCTOCA CTCGCTCATG CCAATAATCC CACCACTTTC CCACACTCAO 
ACACCACCCT O^OqUCACC QCCAJ^CCCC ACCTCTAATT AAAAAAAATA CAAAAATTAG yTACCTCTGA 



ISKfCCr^i^ yrCTCCTCCC AGCTATTCAG GAOOCTCACG TCCCACA^ y ^ff^yCAQCC T CGAAAGTCGA CCCTGCACTC AATTCTCATC ACACCACTGC 
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Figure 8D 



ACTtC^CCCT CACtOAC^flA CTAAGACCCT ATCtCAAAAH AOiCmUUUlO AJUUIACACTO 



AACACCAAAT 

cckCktnrt 

AATAACCCAC 
ATTtAAATTT 
ACTACTGTAT 
CTACTXCCCA 
TCAAACTOGT 
GAATCCCTCC 
TCAAAGAGTG 
TGCGMACTC 
CATATCTTTA 
ATGtATTTAA 
TACAACAAAA 
TACCTAAGAA 



AAtCCACTTG 
CAmXACAK 
CTCTTZCCCC 
AAATTAACTA 
TGGACAGTGC 
CACTCAtCAT 
GACAATTTAA 

cAnmcAA 

GCAGAACATC 
TATGTtCCTT 
CTCTTCrrCA 
t AtCCA TCTA 
CATOCTACTT 
CATAACTTCC 



TACCTACTCC 
CCAAGAOCtA 
CCCCTCCCCA 
CAATTAAGAG 
AAGCGGACAT 
ft'ACYACTAA 
CCCACAATCA 
CATAATGGAT 
ACAATCAACC 
TTtACCTCCT 
CAACTCTCtA 
TCTATTCTCT 
TATCACAATT 
CrrTTtAAAA 



TTCCCCGCTO 
CATTtCTAOC 
CCCACGAAGC 
AAAtTAAAAA 
AfiAACACTCr 
TTTCATCCAT 
GGTAATCTCC 
CTAfGCtATT 
tTTTCCCTTT 
CTGITTCIAC 
ATATTCACCT 
CTAATTTTCT 
AtCTTCCTTC 
CtCCAAGATT 



CICTOCACAC 
AAAATCTGOG 
CA0CATC6TC 
trCAATTCCT 

attactgcag 
ttctctactq 
1cagt1ttaa 
agacatatac 

CTtllACCTT 
TTAAtTGCAC 
TCTTTATCAA 
CATTTTCTCT 
GGTTTGTCGG 
CACGAGATCA 



ATTTCTCCAA 
TTGTAACTTA 
CTTATATAAA 
CAATTGCACC 
AAATfTCTAT 
AGCTTAGAAT 
CMCATTGGA 
CT OCCGA TCC 
TGCTTTTTAA 
CCTATGAGCA 
CCTTTCCrCG 
TCTCATCTAT 
ACATAGAACA 
AAACCATCCT 



GCCCAAAGGA 
CCGTAGTCTG 
GCCTACAGCT 
GnGTGCTGT 
TGCCAAATtT 
TGGATAGCAC 
CTCTGCCATT 
ATTTTCTGGG 
CGAGCTACQT 
AAAGACAATA 

CTGcrrcccT 
aacaAtccct 
tttcatpcat 
gtgcacagag 

CCCTAACATA 



AATGAACTTO 

CAAACAACCT 
GAACCCTAAO 
CGAATAGGTA 
YAAGCACATA 
TTATAATAGT 
AGAGACTGAT 
ACTAGCTGT6 
ATAATTCCAC 
TTTGCTCCAC 
TACCTACCAr 
CTTAATGCAA 
TATCTCTCCA 
TAGGGGATCC 
CTCAAACCCC 



ttACAGAAQC 

ACATATGTA6 
ACATTCCOfC 
ACCACTAGOC 
ACAACCACAT 
TTAGTGTAAC 
AAAtTTAAAG 
TATCTATCCA 
AACC AWC TO 
CTGATCTTAT 
AACCCCTTCC 
GCACTCTTAT 
ACrrCCATGO 
AAGAACCCAG 
GTCTCTTCCA 



AAATTAGCCC CCCCTCGTCG CACCCCCCTA TACTCCCAGC ^AC A CCGGAC CCTCACCCAG GACAAT CCCC TGAACCCGCC AGfl gfifiArJ ^F 
CCAGATCCCG CCACTCCACT CCACggTCgc rrmr ^fiA/r#v* ,>^> -J-IJ^, ^^WMm,s^ 



CQOGGTTCAA 
AATTACCTAT 
fCtCAGAAGA 
ACAtATGGCV 
mGCTAGTA 
TTAAAACtCC 
WTGCAATTA 
GGATrmGA 
OAGAAtGGTA 
AACTCAtAAA 
TTCACTCATC 
TAtGCCTTCA 
ATAACAtGGT 
GAGAATATAT 
AAAATACAAA 



CCAGATCCCG CC^CTgCACT CCAGCCTCGG CCACAGACCC AGACTCCGTC TCAAAAAAAA AAAAAAAAAA AAAGlCCAAfi 



AAGGAtGTCT 
AAGGCTCAAT 
T7AGTAACTC 
ATTAmCAA 
C ATCACt CAT 
ACIX WAW C 
TCCACTTCCT 
CASAATCTCT 
CCTGCTGTGA 
CACTGTCACC 
AAATAAAGTG 
AGTCTGTTAA 
GCAAGACGAT 
CAAATGTtTG 
CCAGAGAfCA 
TCATTTCTTA 
CCCCCTAT6G 
CTTAGAAGGT 
CGCGAAAGGA 
GAGTTAGTAT 
AC CGAAACTG 
AATTTGGTTT 
TGGCCCTTTA 
TCATTCTATT 
AACCCTATTA 
TCTCCAT CCA 
TGnCGATTA 
GKTTAAGAC 
TAATCGCACC 
AAAAACCAAA 
CAAATGAATT 
AACGGTTACG 
TCTATAmr 
TTGACAGACT 
GCAGACAtCr 
TATATTTTAC 
CACCAGAAAG 
CC9GG0CTAA 
TTCCAATTGC 
ACCCA GACAC 
CTGAAGTCTG 
TCCCACVCCC 
CCTCCtCATC 
TATGCTArCT 
ACCTCTATTT 
AGATTQACAC 
GCACCTTTGA 
AAGAGACTAC 
CTGAGCCCCA 
CAGGTCTCAA 
CAGATCTGCA 
GCGGGOCAGA 
TTAATAICAY 
GAAAAITGTT 
AAAAAGAGGG 
CTGCCCAACA^ 
AOCCCGAGCT 



CCTTTCTCAG 
TTACTCACAT 
AACCAGACAC 
ATCATTTCAT 
ACATGCATCT 
CTWCCAAAC 
TATGAATGGT 
CCTCATTTCT 
C7AACGCATC 
TAGAAAAGAG 
ATCACTTCGT 
AGAAAAOGTG 
TCTGACCTCC 
TCGGAATTGT 
GAGCAGGCTA 
CCTTATTCCA 
TTCCCGAACG 
tCCGGACAGG 
AGCAGGGTCT 
ATCTCTAGAG 
CAACCCCTCT 
CGATCCCATG 
TCTGAAGTAC 
AGATTAAAAA 
ACCTACAGAA 
CTTCCCTCAG 
GGACACATCT 
tAYAGTAATA 
TGGCTCTACA 
CT CAGCAT CC 
TCCTTTGTAT 
TTGGACrTAA 
GTAGAAGCAT 
ATAACTCAGA 
CA TACCCC AA 

GCcmrccT 

TACTAGAACG 
CTAGCCTACT 
TTAAACAAAT 
AT6CTCTTAT 
ATGAGACAGC 
CTACCACCTG 
CCTACCTACC 
TGGAAGGCGG 
CACGCATGCA 
ATCATCCCTG 
CCGTCTGCAC 
AGAGGAAA6C 
CAAACTATGT 
ACTTAATTAT 
TTTTATAAGG 
CnGAQTTTQ 
GTCATAAG6A 
AACACTGGGT 
AAAtCGCCGC 



TrrACCATTC 
CATTTCCGTT 
CTTATTTTCT 
GACCTCACTG 
GCACACATAC 
GCAAG AAGC T 
TACTCGTTTT 
GGATGGGCCA 
AAGACAAACC 
TGTTTCAAAA 
GAAGAAATCT 
TAGGTGAGCT 
ATTAAGAAAC 
TGACTTAAAG 
AGGGACTCCt 
GGGCmCAC 
CCCTGCCGGC 
AACGGCGTGA 
CTGAAGAAAT 
GTGtACTAAA 
ATTACTAGAT 
CCCATGACCC 
CTGGTTTTTC 
AAAAGAATAC 
AATGICCAGG 
CTTTGGCCTG 
CACTC C CACA 

tcitcacttu 
aagctagtct 

CATCTCTTCC 
ATGAGTGACA 
ACCTT ACGAA 
GTGTCnGlT 
TCTCTTACTC 
ATAGCTAATA 
TGGCAAGGAT 
CTCCAGGAAG 
GAGCTC ACAC 
ATGTTCAGn 
GACCGGCGTA 
CAGAGCAtGT 
GTGCCTTAGC 
TACCATTCCC 
CTTCGCCATG 
ACACA9CCTC 
CTGGGGCCCA 
TCCCCCAGCA 
AAAAGTRTC 
AGCCATTCCr 
GTAAAGAATA 
ATCCCC ACAT 
C7TGCTAGCT 
TATTATCTPC 
TGTATGCACA 
CTCCACTCGT 



TCTCCTTGTC 
TCTCACCCAC 
mCAAGCAG 
CACACTTATA 
ACATGCACAt 
GACCTACTTT 
CAAAAATATG 
GCTCCACCAT 
AAGCAACACA 
TGCTGAtGCT 
CACAAAGAAG 
CTTTGCAAGA 
CCCAAACCAA 
AGCTCTTCTG 
GGGATCCTGT 
CTCAGCTTGC 
CCCATCCTCT 
GGCCAATGGA 
ACTTCAGGAG 
CTAAAACAAG 
AGCTTTCATC 
TC CCACCTG A 

cattttctgt 

AAfGGAAGCC 
GAAATGGTCT 
AAGCTAtCTX 
TAACATGCAA 
AAAAAGCCCT 
GGACAGACAT 
CAGTCAAATG 
GCAAACACrc 
GA AAACC AAA 
GGnTTTCTG 
AGAGCATATG 
ITTTCATAGC 
GTTTGGTCAG 
CAA6TCTTTG 
ATGTCCAATT 
GTAACTATCA 
CTTACCCAGG 
GCTACCTAGG 
CAGCCCTGGC 
ACTCCCCTCC 
TGCCATCTCT 
GCATGAAAGT 
CTCTCCTGTG 
AATATCCCCA 
TArXTTAGTA 
GCCC6GAG9G 
TtCTGGAGAG 
CATTCTACTG 
TTACCAACTG 
TATTAAATGT 
CTGGTCTtTA 
CACGCCTGTA 



ATTCCAGAAA 
COCCI! ITAAA 
AAAACACTAT 
GTTATTGTAC 
ATCrrCACTC 
CCAGAATAGT 
AGATAAATTG 
CTCATCCTTA 
CTGCCGCrtC 
TCATAAGAAC 
AACATAGAGA 
GCCACAACGG 
CCAACAACCA 
TCACTGGGGA 
CCAGCTttGA 
CAGGCTGGAG 
TTGCTACCTC 
AGGAGGTACT 
TAGAA ACACG 
TCTTCAATTG 
AACAGCTCAA 
CAATTCTAAG 
TTTACCATAG 
AACTCAT TAA 
ATTTCTTATT 
TAAAGGTACC 
ACTTATTATA 
CTATTATTCC 
TTAAACAATT 
ACCTAGACCA 
TTTATTCTAC 
GGGTAAGACC 
TATCTCTCAG 
CCtTCCCATT 
TATGATCCTG 
GGGTTGGCAA 
tCAGGAGtCA 
tCCCCCCAAT 
ATACCAGTAT 
GCTTTGCACT 
GACTTCAATC 
TCTCCCTCCA 
CCCACCGGGG 
GCACCAAGAA 
TCCAGTCCCt 
GCTGGCAGGC 
TGCTAGAAGG 
TCITCCATGG 
AGGAClrrXA 
CTTCCTTTTA 
CAATTCGTCC 
ATCTCAGGGG 
CrrTAAAACA 
TTATATTATG 
ATCCCAGCAC 



TCCTGAAACC 
rCCACTCAGc" 



tCAA ATCCCA 
CCAGATGTTT 
GAGATGGTCO 
CTGTTCTCTT 
CCTATTTCAT 
TGTGAAAGAC 
AGTGtATAAA 
ATCTGCACGG 
AGTGGfGAAA 
CTTTCTCJTC 
CTTCACmC 
AAAC GGCAAG 
CTGGGTTGGT 
CAGCAGCGGC 
GACCCTACAG 
CCAAGGGCCA 
CTAAAGCCAA 
TCACTtTCCC 
AAGCTAGAGG 
CATACCGCCA 
AACCGACAGA 
CATGCGCAAA 
G CCTCAC TTC 
GCTTTCCTTA 
CTATTTTTGA 
CTGTACAAGC 
TGTATGAACC 
TATCTCAGAT 
ATCCTCTAAG 
AAGGACTAGG 
AACTT CCCTG 
TGTTGTTCTG 
TCTGAAAGAG 
TTCCCCATTA 
AACGGCCAAA 
AAATAATGCT 
GACTAGCTAC 
ACACTAACCA 
ATAACAGTGT 
GAGACAGGTC 
CCTGCAGCCC 
AACACCTGCG 
ACATGGGCAT 
TGCCCCACAG 
TGGGAATCAC 
ACGGGAtCCV 
AGCAAAATAT 
CACTTTTCtT 
AAACATAAAG 
CCCAGTCCCA 
ACAGACCATC 
ACTTAATGn 
CC ACAATGAT 
TAGTTmrC 
TTTCGGAOCC 



CCATCTCTAC 
CAACATCTCA 



AATACATTTA 
CCCAATCCAT 
TrGTCGTTCT 
TTTGCTCTCA 
CCACATGAAC 
CCTGTCATAC 

ACTCArrrrr 

AGGAAATACT 
ACATTATATA 
AGAGTTGGTT 
AtCTGGAGTA 
ACAACTTCTT 
TACCCAGGTT 
7AGATAGCCC 
AGCCATGTtC 
ACGCAGCCGC 
AGGCACTGGC 
TCCAGATGCC 
GTTAAATGCA 
CGTAGGGAAG 
TTTAAAGAAG 
C TjOGCCC CAA 
GGTCTCTGGC 
TOCTTATATT 
CCTAAAGAAA 
TCTTGCCCTG 
ACAACTTCTT 
GATAAAAATT 
ATTATTTGAt 
CAAACCACAT 
CGtAAGTAGG 
GGCrGGCATT 
GGAAAACA6G 
TZCCCCAACA 
CAttCCAAAA 
CTXCAGACTT 
AtCATAATCT 
GAGGAGAAGG 
TGGCCAAGTT 
GTCCATCTGA 
CAtTTCACTT 
CAATGAGCAC 
AGGAGCAGGG 
CACATTTGGA 
CTCTCCACCT 
AAGGACATCT 
tAAAT GGCAA 

ccTAGcrm 

ATTATCAAGT 
GCCACCAAAT 
CCTGGACCGA 
TACCTCTAAA 
TAGC CCAAAG 
TGTATmTA 
CAAGGC6CGC 



AATCAGAACT 
TAACAATGCA 
TCCGGGAGGG 
AGCCTAGCTA 
TAAGATTACT 
TTCTGCAtTG 
AGACAAAATG 
AGATTTCATT 
tCTAC CTITG 
VCVit'T ATCT 
ATGAACAGAT 
TGTGGACTTA 
GGGCAGCATT 
CAITCAGGGA 
ACCTAGCAC6 
CCCTTGTTCG 
GGCCCGGGCC 
CAGCGATGGG 
CTACACAGGA 
AAATCAAAAC 
CAACAOCGCA 
AAATTCCTCC 
CTATWATTC 
AAGTTGTAGC 
AICTTTAAAA 
TACAGCTAGC 
TTT C CTTAGC 
CAATTAAGAG 
GAAAVGCATT 
CtCTCGGCAT 

cagaataatg 
gtcaatgjUg 

CT CCPATTAG 

tgatgtcttt 
ccaagtagtt 
aaaagaacac 

CTCTGCCCAG 
ACCGTGATAT 
tTATTGATGC 
CCTTTACTCC 
CTCACCACCT 
TGCCACCCCA 
AGAGTTAAGG 
GGGTGAATGG 
TGATTGCCAC 
GGCTCTtAAA 
ATTTTAAAAA 
GAACAAGGGG 
CrtGGAAATT 
AnCTCATAA 
ATTTGGGTGC 
CTTTAGCTCA 
TAAACACTCA 
CAACATTTCA 
CCATCACGAG 



ArrrTAAAAA 

AAA AAGGC GA 
CATGTnCCT 
ACAAGATATA 
AGATCATTTG 
GATCTGTACA 
TTTCCTCCAC 
AAACAGGAAA 
GCAGATCAGA 
AAATATCAAA 
TTCAAATTAG 
TGAACAAACT 
AGGGTGAAAO 
GGGAGCAAAT 
GAGGGCATTT 
TATCCCGTCT 
CGATGGTAGC 
AGCTTCTAAA 
CTCAGAGCZC 
ACAGAAATCA 
CTTTG A ATAT 
WTTGGCTTT 
CACATTTCCG 
TACATTTAAC 
ATATGCATTT 
TGTCTTAGCA 
TACAGAGATT 
GGCCAGGATG 
AAATAACAAC 
TCAATGACTA 
AGCAA GCTGT 
GtlTTACTGA 
AGCATAAATT 
ACTATGACTA 
AAGAACTTGT 
TGTAATATCT 
AACCATATTT 
GOGCTGTOGA 
CATTGCATCT 
TGACAATCAA 
TTTGCATTTT 

tcccgggtgg 
tggtgcccag 
ttggtcaggt 
tggocgcaca 
agtaggccag 
ttctagcctg 
tgtaacaaga 

CCCCACAtTT 
CTGAlTCAGT 
ATTAAGCTTC 
TTAGGAGCAC 
TGCATCTATA 
ATAAATCTTC 
GAATTAAAAG 
CTCGAGAGAT 



GGCAGTCAOe 
AAAAAAAAAA 
AGAGGGTATA 
CAAACAAAGT 
AATGATACAC 
GAATGTTCAA 
GATTCAAAGC 
ACCACC1CCA 
TGAAAGAAAC 
CTGCAGCAAA 
TACTGTTTAG 
CCAOGGTGGG 
AGAAATCGn 
TTGCAAGCAG 
GTTGAttGAA 
GTTCACCTGG 
GCGGTCACGG 
TTCCCACGAG 
GTCGCGCAAG 
CTTCAGAACT 
GTTTTTCTSA 
TAGTGAAAAA 
CTAAAGCTXt 
AAGAACTATT 
AATTTGAXGA 
ACCGATAGTT 
TTTTCCCCAC 
CAATCCTTTC 
TTACACTAAG 
GTGACATGTG 
GTTAACCATT 
ACA9CACAAA 
AATCGCAGGT 
CAGATGTGAA 
ACAAAAATGT 
CCT9GACCCA 
TTAATCCAAA 
CTTAGCCATC 
TCTCATCCAT 
GAATTCTTAA 
TTGGAGTTAC 
TGTTTTGTAA 
TTTCTCCACC 
ACATCCTCTC 
GCACGtGCCC 
CCCCTTGTCe 
TGACAAGGGA 
CACnCCTGG 
TC GCTTGCA A 
TYAtl'l*CTCA 
AGATATATAA 
GATtAGCCCC 
AAATTCTCGA 
AATAAATATA 
AAAAATTTAG 
CAACAGCTAG 
CGACACCATC 



TGAAAAAGGA 
TAAATICAGG 
AGACTCTGT C 
CCACCTTTCT 
CTCGACTGTA 
CCAAAATTCA 
AATGACCCCT 
AAAAGACCTC 
ATCTCAAGGA 
AGGACCrCAA 
C CimiHU T 

ATTOCTGTAiF 



AGTCAGAGCC 
ACAAAAAGAA 
CACTTTAATA 
TCCTATAAAG 
ATTATQCAAC 
AGCACAOCTA 
CCTATTCCTC 
CTAGAAAAAT 
6TCAAGCCAG 
ATCACCAAAA 
TTATCCACCA 



CTTTAATGAC 
CATGCCCCAA 
CAAGGGCAGT 
6TGGATCAAG 
GAATATOCAQ 
AGTTGATArr 
ACTGC CTGCA 
CAGTAG'll'VI 
TGTGTTTCCT 
CAACCCATTC 
AAAACACTAA 



TAAAAATAC A AAAATTAACI GGGCATGGTG GCATGCGCCT CTACTCCCAC CACAATTGCT TCAACCTGGC 
CCACTCCTCT CCAGCCTCGT GACAGCGCAA CACTCCCTCA AAAAAAAAAA AGAGAGGGAG AGCCAGAGTA 



TCACCTTTCT 
GGACCATCTQ 
CGTTCAGAAC 
CCACTT GC tT 
TTGAGATAAT 
TTACGACAAA 
TGCCTATAAG 
TCTCTTTT6A 
AATCTATCTC 
CTCAGATTTG 
rrCATCTGCA 



AGGTCTCCAG 
ATATCCACTT 
TACTCAOCTC 
ACAACTCGAA 
CCA C TrcCCT 
GGCAGCTTAC 
CACACCCCTt 
GTAACATGTA 
CrGTAtCCCC 
CCTAAGCTTA 
AAGCTCACAX 



GTAGGAGGCr 
AAAGTGA9GG 
CTGAAAAGGA 
CTCAAAtCCT 
CTTATCTAAT 
TATCCACCCA 
ATCGAGCCCT 
GCAAAAAAAA 
AtGAAGCAAA 
AGCTTCCCT6 
TTCCAGAAAC 



AGTCCTTCAC 
ACTACCTCGT 
TTTACCAAAT 
CCAAGTCGAT 
AATACCCAGG 
GAGC gCACT A 
TAGCrCTTGC 
TTCCATCATG 
TTTTCCCATC 
TCTCTCATTC 
AITCCAtTTC 



TCTC TAGCAC 
TTCCCTTGTT 
GTTGAGTGTG 
CTAGACATTG 
CTCAATCCCT 
CAATATGGTT 
TTCACAATAT 
TAGCTACAGG 
AGAGAAACTG 
TGTCrT G CTT 
TGCCAGCACC 



CCAGCACACA TATnAmT TTTCTAGAVC AAATGTATTA TGCAGTAAGA GICTTAATTT TCTTITCACA 



ATAGTAGGTG TTCAGTAAAT 
TATGAATGGG TTCATOCCTA 
CCCTCTAGTG TtCACACTTC 
AGATGGAGAA AATATTCATT 
CACTCCTTTC TCCACTTTGC 
AAGAGAGAGT GGAAAGAATG 
TCCAGTTTGA AAAGGGTTTG 
GAACACCCTA GTAACTATTA 
ACTCATGGGG AAAAAATCCA 
TCAATGCAGT TAeATAAATO 
TAGAAG CC AA VATTTBOCCi* 
19$ 

V»l val Leu Aan Cly 
G QIT GTT TTO AAT GGT 



c-ys VbI Abp Ala »h« Cf Cly 



AAA <ST9 €iA ^ 3^1 iSllli yiSliHL S^" '■y" Cy» V«l Clu Thr Gly Val 

AAA GTT GAY GCA TTC TCT GGA GGC TCT ATC GTT AAT GAA AAA TG6 ATT GTA ACT 6CT GCC CAC TGT GTT GAA ACT GGT GTT 

\li J^* **** 

«A ATT A£A OTT GTC OCA O GTAAAT A T ACACAAA O AATAATAATC TGCAGCACCA CTAGCTCTTT AATATGATTC GTACACCATA TTTTACTAAG 



11* 

lie 
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Figure 8E 



titCTAArAAA ATTGTTCrrC AATAAATTGG GCTAAAGCCA 6AAGGGTCAT AATTTCACAA CCCACGTCGC ACCGTCCTCC AACCA9CCAT 
MAt ACCCCT ATTATCACTC ATTTCAfiTGA GGTACAATTA GTTCTT6ATQ TAOCCATTTC CATACCAGAA CCCCTTCCCA AAAATCA6T6 
aTOCTTTTA TCTCTGGItSC TTGGCACAAC CTGTAGCACG KCTCAGAAA ACAAACATTt GAATTAATGG CCAAATGAGT TTGtOCKAA 
juGGHTACTTe AAATTT06AA AATCTAOGAT AATTCATGAC TAGT66ATTC AXTATCACCA ATGAAACSGCT TATAACACCA TGAGTGAACA 
mfCATAfirC CTGAATGGCT TTTTGGlCtC AAAAATATGC AXTG6CVCTC ATTACATrTA ACCAAAATTA TCACAATATA ACAATGAGAT 

QCCMTTAGQ TCACTCGTCC CAACTAGtCA CTTAGAAAAT CTGTCTATGT GAAATACTOT RGTGACTTA AAATGAAAtT TATRTTAAT 



ACTTCTTTTG 
TCATGTCACC 
AAAACGGOTG 
CAAC CATC^C 
CTtTAACATT 
234 

Gly Gill 
AG^CT GAA 



111 Asn 
ar AAT 


%l9 
ATT 


Glu 
GAG 


GlYI 
GAG 


The 
ACA 


GlQ 
GAA 


Hla 
CAT 


Thr 

ACA 


61a 
GAG 


Gin Lya 

CAA AAG 


Arg 
CGA 


Afln 
AAT 


Val 

GT6 


Xla 

ATT 


Arg 
CGA 


XlC 
ATT 


Zle 
ATT 


Pro 

CCT 


Hla 

CAC 


Bla 

CAC 


Asn 
AAC 


Tyr 

TAC 


Aan 
AAT 


Ala 
GCA 


Ala 

GCT 


He Acn 
Wr AAT 


Lye 

AA6 


Tyr 
TAC 


Asn 

AAC 


BlB 

CAT 


GAC 


Il« 

ATT 


Ala Lao 

CCC CTT 


Leu Glu 
CTG GAA 


Leu 
CTG 


A«P 
GAC 


Glu 
GAA 


Pre 

CCC 


Leu 
TTA 


Val 

GT6 


Leu 

CTA 


Asn 

AAC 


Ger 

AGC 


Tyr 
TAC 


Val 

GTT 


Thr 
ACA 


Pro 
CCT 


Zle 
ATT 


eye 

TGC 


at Ala 

m ccT 


GAC 


AAG 


Glu 
GAA 


TAC 


Thr 
ACG 


Atn 

AAC 


lie 
ATC 


Pha 

TTC 


Leu Lya 

CTC AAA 


Phe 
TTT 


Gly 
GGA 


Ser 
TCT 


Cly 

GGC 


Tyr 

TAT 


Val 

GTA 


Ser 
ACT 


Cly 

GGC 


Trp 
TGG 


Cly 
CGA 


Arg 
AGA 


Val 

GTC 


Phe 
TTC 


Bis 
CAC 


Lya 
AAA 


Cly Ar9 
CCG AGA 


6er 
TCA 


Ala 

GCT 


Leu 
TTA 


Val 

GTT 


Leu 
CTT 


Gin 
CAG 


Tyr 
TAC 


Leo 
CTT 


Arg Val 
AGA GTT 


Pro 
CCA 


Leu 

CTT 


Val 

GTT 


A»p 

GAC 


Ars 

CCA 


Ala 

GCC 


Tbr 
ACA 


Cy» 

TGT 


Leu 
CTT 


Arg 

CGA 


8er 
TCT 


Tbr 
ACA 


x.y* 

AAG 


Phe 
TTC 


Thr 
ACC 


11c ryr 
fcIC TAT 


Asn 
AAC 


Asn 
AAC 


K«t 

ATG 


Phe 
TTC 


Cys 
TCT 


Ala 

GCT 


Cly Phe 
GGC TTC 


Bis Glu 
CAT GAA 


Gly 
GGA 


Cly 
GGT 


Arg 
AGA 


Aap 

GAT 


Set 
TCA 


Cya 
TOT 


Gin 
CAA 


Cly 
GGA 


Asp 
GAT 


Ser 
AGT 


Gly 
GGG 


Cly 
GGA 


Pro 
CCC 


Bis 
CAT 


Val 
GTT 


Ttr tilu 
ACT CAA 


Val 
6T6 


Glu 
GAA 


Gly 
6G6 


Thr 
ACC 


Scr 
ACT 


Fbe 
TTC 


Leu 
TTA 


Tbr 
ACT 


Gly Xl« 

GCA AT? 


Zle 

ATT 


ser 

AGC 


Trp 
TGG 


Cly 

GGT 


Glu 
CAA 


Gltt Cys 

GAG TGT 


Ala 

GCA 


net 

ATG 


Ly» 

AAA 


Cly 
GGC 


tye 

AAA 


Tyr 
TAT 


Gly 
GGA 


He 
ATA 



415 

fyr Thr Lya Val Ser Arg Tyr Val Aan Trp Zla Lys Glu Lya Thr Lya Leu Thr STOP 

nT ACC AAG GTA TCC COG TAT GTC AAC TGG ATT AAG GAA AAA ACA AAG CTC ACT TAA TGAAAGATGG ATTTCCAAGG TTAATTCATT 



GGMTTGAAA 
A0CO6AC AAT 
CCCCTTGACA 
CC&TTOCATC 
C OCTACCA C 
AlCCTTtTCCA 
RCTCTTATA 
CCWAAAIIOT 
T6IGT6TGTA 
OIGiaTTCA 
VJOGAAAS 
CMfiTTGAAG 

anccAATC 



ATTAAC AGCC 
TTCATATTTT 
AAATTGTGAA 
TTCCCCATCT 
ACTCATGAAG 
AATCCCAATC 
CTTCTGTACA 
TTOGGGGAAA 
TGCGTGTGT6 
C6AAGGCAAG 
AAGAGAACGG 
TTGCCTAGAC 
AfiCCAACTAA 



WIGACCAA CTGACGTATG 
gTCTAT TTO ATCAT Q CATC 
CTTGCTAGTT OCTTTAGTTC 
ICOCCTATAA TCCCAGCACT 



CCTCTCACTA ACTAATCACT TTCCCATCTT 
ACCTGAGCAA ATTGATTAGA AAATCGAACC 
GTTAAATTCT CCACTCTGTC CATCAGATAC 
TCTTTCCTTC TCCAACCAAA ACATCAATCT 
AAAGAACACA GCAGTAGCTG AGAGGCTAAA 
CCCAAATCAC TTTTTCTCTT TCTTACTCCC 
C ACTTA TACA TGTCTATCAA ACCCAGACTT 
AGTTTCTTTC AGAGAGlTAA GTTATTTTAT 
TAGACACACA CGCATACACA CATATAATGG 
ATTGGCATAT CATTGTAACT AAAAAAGCTG 
TTCGTTTGCA ATCTACAGCT AGTACAGACT 
CAGAGGACAT AAGTATCATG TCTCCTTTAA 
C TT C l' CC T I T T CT CC ITTCG TCT TCACCAT 
WlCCLills; TGAATl ftlATA JUfc TCCTCTT 
CGACTACTGA CAAAATCACT CTGACCCTGC 
TTTTAGTCAA TATATTTTTG TCTTCCCATA 
TCTCGAGGCC 



TTCTTAGATT TCAATATATA CATTCTATGA 
ACTAGAGGAA TATAATGTGT TAGGAAATTA 
TATOGrrCTC CACTATGGCA ACTAACTCAC 
TTATTAGTTC tCTATACACT ACAGGATCTT 
ACTCATCAAA AACACTACTC CTTTTCCTCT 
TCTCTCCCTT TTACCCTCCA TGGTCGTTAA 
GCTTCCATAG TGGAGACTT6 CTTTTCAGAA 
ATATATAATA TATATATAAA ATATATAATA 
AAGCAATAAG CCATTCTAAG AGCTTGTATC 
ACATTGACCC AGACATATTG TACTCTTTCT 
TTGAGGAAGA ATTCAACAGT GTGTCTTCAG 
CTAGCATACC CCGAAGTGGA GAAGGGTGCA 
CGAAC AT TTT GATTATAGTT AATCCTTCTA 

CAAGCTGCTG CCTTCTCCTC 
TAAGTATAAA TAAACATATT 



nGTGGATTC 
CCCCAACCTC 

TTTAAATTTC 

AACCTGCGCG CATCACCTCA GCTTAGGACT TTCACGCCAG CCTCCCCAAC ATGGTCAAAC CCTCTCTCTA 



TCATTCCTIT 
CACTCATTTC 
TCAATTTTCC 
TOGTCTACTC 
ACCCTATTCC 
A6GAGAGATG 
CATAGGGATC 
TACAATATAA 
GTTATGGAGG 
AAAAATAATA 
CAGTCTTCAG 
gCAGG CTCAA 
TCTTGAATCT 
£ATTGATGTG 
ACCCCCAGCC 
TTGGCTGGGC 



TTCTCTTTAC 
TAAGGGCCCA 
CTCCTTAGCA 
TATCACAAGG 
TCAATCTTTT 
GGGAGCATCA 
AAGTAA06TC 
ATATATACTG 
TCTGACTACG 
ATAATAATGC 
AGCCAACCAA 
AGGCATAAGT 
TCTA6AGAGT 



AATCAGTCAC 
AGGCCTCACT 
CCAgTGGCTC 



CrAAAXA?AG AACAATTACC TGGGCTTGCT AATCTGCACC TATAATCCCA GCTACTGGSC ACGCTCAGGC AGCACAATCA CTTGAGCCTG CCCACCACCC 
pgTCCCCGAC GTrCCACTCA GACAAGATCC CACCAGTCCA CTCCCCATCC TGGCTGACAC ACTCACACTC TGTCTCAAAC AAAATAAATA AATAAATACA 



TnCTTGAGG 
leCACAGATT 
TCTAACTAAC 
UTCCCTTGC 
ICTATTCATT 
1QCTCAATAA 
JkfMTGCCTC 
ecCTCAGATT 
CTIUGACACT 
kCUM»AGC 
CKCACTAAAC 
UKATTACT 
lAKGTGAAGG 
WGJUCGAAC 
mATAATTG 
flCAGJOGAAG 
MTOCTTTCA 
AMCTQCCTG 



CGTTTCTTGT 
TGATC CTCTG 
TG TGACTAT C 
AATTTTGTTA 
AAATTC TAAT 
CTTGTTAGCT 
GGGTTTAGCT 
TAAC GACGAC 

TCTTCCACCC 
ATGTGG6GCC 
GCTTTCAAAA 
AAAGTGTGCT 
TACTGTCCAA 
CTAAACAAGT 
AGATTGAGGG 
ACTGGAGTAC 
GAATTCTTAC 



TAAATCATTC 
TTATCCAGTT 
GCCTTCAATA 
GTCTCT6ATT 
TCACATTTTA 
GAATCAGAAT 
TTTATGAAAA 
CCTAATTCAT 
TTCCATAGAG 
ATAAATCTCT 
TTTAAGCTCA 
TTACTGTAGA 
AATATAAT6C 
TTlT C l'CT tiC 
TCCGAATGCT 
CAACATAAGG 
ATTAACTCTA 
TAACAATTCT 



ATGGAGAGGC 
TTCATATAAG 
AGTCA CTCTC 
TCATCTTTTT 
TATATATATA 
CCATCTTTAT 
AATGAAAGGA 
GCATTAAACA 
ACCACCCCTT 
GACCCACAGT 
CTAAGAGCCA 
TACTT TCCC T 
AGTCATTTTA 
CAAGCAAAAA 
TACAGTCTAA 
CCAG6CCCAA 
CTGTCTATAT 
CTAATTGACC 



ATCCCAAACA 
GCTGTGTGAG 
C ATTTCC GAA 
ATTTTTATTC 
AATAATGTAT 
CCCACACTAC 
AATTAGTTCT 
CCTTACTTCA 
ACAAAGGCAC 
TCTTTTTGCC 
ATGTCTTCAG 
A AATA CACTA 
ACTTGCTGTT 
AGAAAAGGTG 
TCCAAGAATC 
OATAACACAA 
TTTTAGG6CA 
AAAAOGTGAC 



CCACATTCAA 
CTCTCTCCCA 
TTTGATTTCT 
CTTCATCCAA 
AATGTGTATA 
CAATTACTCT 
CCTTTTGTrC 
AATITCAATG 
CAATGCGAAA 
ATGOGCTCAA 
AGCCAGCCCT 
AAACATGCTG 
TAACTC TGAT 
TTCTTCCTTA 
TCAGAGCTGC 
TGACAGCCAG 
GCTGGGGCAT 
GAAATCAAGG 



CAAAACACTC 
GAGAGGACAG 
CCACTTCTAT 
CA AATACTCA 
AATTGCTTTG 
TGCATCCACT 
ACTAAAGGAA 
ACCAAAGGTC 
CTGGCCTCAG 
AAATGATTGA 
CGCTTCATTC 
C CCTCATA TT 
TGTTTTTAGT 
CTTACCTGAA 
AAGGGCCCTT 
GACTAGAGCT 
TCTGCATTGG 
A6ACCAATAA 



TGAAAAATGT 
TGCTCTCAAT 
AA7GAGAGTA 
AGGACTAATT 
TGAGTGCCTA 
ATCGTGAAAG 
GCGAAGAGAQ 
TGTAGCCTCA 
CACTCCTGTT 
ATTCATCATG 
TACCTAGGGC 
GGAAOTOACA 
TCTTTTGAAT 
CCAAAACAGA 
AAACACCATC 
CAAGTCTCCC 
TGGCAATCCT 
GGTAGCCTTG 



TTTCAAATOC 
CCACCTGAGA 
TTT6ACAGGA 
CCTCTCTOCC 
CTACACTGCT 
AAGGCCACAC 
AAGAGACACT 
GCACTTCAAA 
ATTGGTCTTC 
AGCCACCTGT 
ATTTGCAGTT 
GATTAAAATA 
ATTATTTGTT 
CCAGTTTACA 
CAATCCACTC 
ACCCTGCACT 
CTCAACAACC 
GAAAGCAAGA 



AATATAACAC 
CAGAATTGCG 
TCCTCTCCCA 
AAATACCAAC 
AGACAGTAGT 
TTAAATAAGA 
ATAATTGTCT 
ATTGGTAAAA 
TCTGTGGCAG 
G6CATATTCC 
GCCATATAAG 
6AACTCTTGC 
TTATACTGAC 
AAATTCCCTA 
CACTCATTTA 
TTGAAAGAAT 
CTGOGACTGA 
GTGGC 



30329 
30439 
30539 
30439 
30729 



30B24 
30907 
30980 
31069 
31150 
31331 
31313 

31399 

31499 
31599 
31499 
31799 
31899 
31999 
32099 
32199 
32299 
32399 
32499 
32599 
32499 
32799 
32999 
32999 
33099 
33199 
33299 

33399 
33499 
33599 
33499 
33799 
33B99 
33999 
54099 
34199 
34299 
34399 
34499 
34599 
34499 
34799 
34899 
34999 
35094 
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Figure 9A 



1 gaattccgtg gatgtgcttt aaaaccacac ctaacgtttg agcacaagtc tcacgaactg 
61 gcgctacaac ttcatcagaa acgaagtctc caaatctgtc caacgcaaaa acacaaagga 
121 gtctaatgac taagtcttcc aaccacaact gtctgctgcg cccggaaaac aagccggggc 
181 tctggggacc cggggctcag gccgcctcgc tccggcctag ccccgccacc ttagttgtgt 
241 catcccccgg gcatgctgag catccccccg cggctccggc acagacgccc ggacctcagg 
301 tctctgcctc cgcgcggggg cccggccctg t;ggccggagg gagcggccgg atggagcgga 
361 ggatgaaagg cggatacttg gaccagcgag tgccctacac cttctgcagc aaatctcccg 
421 gaaatgggag cttgggcgaa gcgctgatgg tcccgcaggg aaagctcatg gacccgggct 
481 ccctgccgcc ttccgactca gaagatctct tccaggacct cagtcacttc caagagacgt 
541 ggctcgcaga agctcaggta ccggacagtg atgagcagtt tgttcctgat ttccattcag 
601 aaaacttagc tttccatagc cccaccacca ggatcaagaa ggaaccccag agtccccgca 
661 cagaccccgc cctgtcctgc agcaggaagc caccactccc ctaccaccat ggagagcagt 
721 gcctttactc cagacaaatc gccatcaagt cccccgctcc cggtgcccct ggacagtcgc 
781 ccctgcagcc cttttccagg gcagaacagc agcagagcct cctgagagcc tccagctctt 
841 cccagtccca ccctggccac gggtaccttg gtgagcacag ctccgtcttc cagcagcccg 
901 tggacatgtg ccactccttc acatctcctc agggaggggg ccgggaacct ctcccagccc 
961 cctatcaaca ccaactgtcg gagccctgcc caccctaccc ccagcagaac ttcaagcagg 
1021 agtaccatga ccccctgtac gaacaggctg gccagcccgc ttcaagccag ggtggggtca 
1081 gtgggcacag gtiacccaggg gcgggggtgg tgatcaaaca ggagcgcaca gacttcgcct 
1141 acgactcaga tgtccctgga tgtgcatcaa tgtacctcca cccagagggc ttctctggac 
1201 cctctccagg tgatggagtg atgggttatg gctatgaaaa atcccttcga ccattcccag 
1261 atgatgtctg cattgtccct aaaaaatttg aaggagacat caagcaggaa gggattggag 
1321 ctttccggga ggggccaccc taccagcgcc ggggtgcctt acaactgtgg cagtttctgg 
1381 tggccctgct ggatgaccca acaaatgctc atttcattgc ttggacaggc cggggaat:gg 
1441 agtttaaact aattgaacct gaagaggttg ccaggctctg gggtatccag aagaaccggc 
1501 cagccatgaa ttatgacaag ctgagccgct cgctgcgata ctattatgag aaaggcatca 
1561 tgcagaaggt ggctggcgaa cgctacgtgt acaagtttgt gtgcgagccg gaggccctgt 
1621 tctctctggc cttcccagat aatcaacgtc cagctctgaa ggctgagttt gaccggccag 
1681 tcagtgagga ggacacagtc cctttgtccc acttggatga gagtcctgcc tacctcccag 
1741 aactcactgg ccccgctccg cccttcggcc acagaggtgg atattcttac taggcaccag 
1801 tggcttcccc ttgacatggt ggggttgctc agtgtatata tcaactgatt tggtattggt 
1861 gaaggccctc tttctgatgc ctgtagaagt ctctggggtc agagctccac tatcccatct 
1921 gatactcctg gccagactca gctgctaacc agagtctgcg ggaaagacag tggaggcagg 
1981 ccaaatctaa aggcagtagc tgaagttcgc tgtggctcac ctgtaccttc agttcagctt 
2041 ggcctctgcc taggtcttgc tcagaggcca agttcctcac ccccaccaca gagatccagt 
2101 gttctattct ggggacatac agggacttcc cttgtttatt atggcaacag ggccaagggg 
2161 attctcagaa caccctgtgt ctcccctctc ccaacccccc atgggagaca aagttctgcc 
2221 tggcttctgc cctgaacagg ggggtcctgt gttcttggtg ctgtgctctg ggaggcagga 
2281 gcatgtgggc ggcagctggg ggggggtgtg gaagtagaga tggctctctg ccctaggcct 
2341 acccaggcct aattccacct ttgcctctta tgccagacct taataaagcc tctgcttctc 
2401 cccggaattc 
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Figure 9B 



MTKSSNHNCLLRPENKPGLWGPGAQAASLRPSPATLWSSPGHA 

EHPPAAPAQTPGPQVSASARGPGPVAGGSGRMERRMKGGYIiDQRVPYTFCSKSPGNGS 

LGEALMVPQGKLMDPGSLPPSDSEDIiFQDLSHFQETWLAEAQVPDSDEQF^ 

LAFHSPTTRIKKEPQSPRTDPALSCSRKPPLPYHHGEQCLYSRQIAIKSPAPGAPGQS 

PLQPFSRAEQQQSLIJU^SSSQSHFGHGYIiGEHSSVFQQPVDMCHSFTSPQGGGREPL 

PAPYQHQLSEPCPPYPQQNFKQEYHDPIiYEQAGQPASSQGGVSGHRYPGAGVVIKQER 

TDFAYDSDVPGCASMYLHPEGFSGPSPGDGVMGYGYEKSU^PFPDDVCIVPKKFEGDI 

KQEGIGAFREGPPYQRRGALQLHQFLVALLDDPTNAHFIAtnXSRGMEFKLIEPEEVAR 

LWGIQKNRPAMKm^KLSRSU^YYYEKGIMQKVAGERYVYKFVCEPEALFSIiAFPDN^ 

PALKAEFDRPVSEEDTVPLSHLDESPAYLPELTGPAPPFGHRGGYSY 
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Figure 10 (A) 



1 gaattccagg ttggaggggc ggcaacctcc tgccagcctt caggccactc tcctgtgcct 
61 gccagaagag acagagcttg aggagagctt gaggagagca ggaaaggtgg aacattgctg 
121 ctgctgctca ctcagttcca caggtgggag gaacagcagg gcttagagtg ggggtcattg 
181 tgcagatggg aaaacaaagg cccagagagg ggaagaaatg cctaggagct accgagggca 
241 ggcgacctca accacagccc agtgctggag ctgtgagtgg atgtagagca gcggaatatc 
301 cattcagcca gctcagggga aggacagggg ccctgaagcc aggggatgga gctgcaggga 
361 agggagctca gagagaaggg gaggggagtc tgagctcagt ttcccgctgc ctgaaaggag 
421 ggtggtacct actcccttca cagggtaact gaatgagaga ctgcctggag gaaagctctt 
481 caagtgtggc ccaccccacc ccagtgacac cagcccctga cacgggggag ggagggcagc 
541 atcaggaggg gctttctggg cacacccagt acccgtctct gagctttcct tgaactgttg 
601 cattttaatc ctcacagcag ctcaacaagg tacataccgt caccatcccc attttacaga 
661 tagggaaatt gaggctcgga gcggttaaac aactcacctg aggcctcaca gccagtaagt 
721 gggttccctg gtctgaatgt gtgtgctgga ggatcctgtg ggtcactcgc ctggtagagc 
781 cccaaggtgg aggcataaat gggactggtg aatgacagaa ggggcaaaaa tgcactcatc 
841 cattcactct gcaagtatct acggcacgta cgccagctcc caagcaggtt tgcgggttgc 
901 acagcggagc gatgcaatct gatttaggct tttaaaggat tgcaatcaag tgggacccac 
961 tagcctcaac cctgtacctc ccctcccctc cacccccagc agtctccaaa ggcctccaac 
1021 aaccccagag tgggggccat gtatccaaag aaactccaag ctgtatacgg atcacactgg 
1081 ttttccagga gcaaaaacag aaacagcctg aggctggtca aaattgaacc tcctcctgct 
1141 ctgagcagcc tagggggcag actaagcaga gggctgtgca gacccacata aagagcct:ac 
1201 tgtgtgccag gcacttcacc cgaggcactt cacaagcatg cttgggaatg aaacttccaa 
1261 ctctttggga tgcaggtgaa acagttcctg gttcagagag gtgaagcggc ctgcctgagg 
1321 cagcacagct cttctttaca gatgtgcttc cccacctcta ccctgtctca cggcccccca 
1381 tgccagcctg acggttgtgt ctgcctcagt catgctccat ttttccatcg ggaccatcaa 
1441 gagggtgttt gtgtctaagg ctgactgggt aactttggat gagcggtctc tccgctccga 
1501 gcctgtttcc tcatctgtca aacgggctct aacccactct gatctcccag ggcggcagta 
1561 agtcttcagc atcaggcatt ttggggtgac tcagtaaatg gtagatcttg ctaccagtgg 
1621 aacagccact aaggattctg cagtgagagc agagggccag ctaagtggta ctctcccaga 
1681 gactgtctga ctcacgccac cccctccacc ttggacacag gacgctgtgg tttctgagcc 
1741 aggtacaatg actcctttcg gtaagtgcag tggaagctgt acactgccca ggcaaagcgt 
1801 ccgggcagcg taggcgggcg actcagatcc cagccagtgg acttagcccc tgtttgctcc 
1861 tccgataact ggggtgacct tggttaatat tcaccagcag cctcccccgt tgcccctctg 
1921 gatccactgc ttaaatacgg acgaggacag ggccctgtct cctcagcttc aggcaccacc 
1981 actgacctgg gacagtgaat cgtaagtatg cctttcactg cgaggggttc tggagaggct 
2041 tccgagctcc ccatggccca ggcaggcagc aggtctgggg caggaggggg gttgtggagt 
2101 gggtatccgc ctgctgaggt gcagggcaga tggagaggct gcagctgagc tcctattttc 
2161 ataataacag cagccatgag ggttgtgtcc tgtttcccag tcctgcccgg tcccccctcg 
2221 gtacctcctg gtggatacac tggttcctgt aagcagaagt ggatgagggt gtctaggtct 
22 81 gcagtcctgg caccccagga tgggggacac cagccaagat acagcaacag caacaaagcg 
2341 cagccatttc tttctgtttg cacagctcct ctgtctgtcg ggggctcctg tctgttgtct 
24 01 cctataagcc tcaccacctc tcctactgct tgggcatgca tctttctccc cttctataga 
2461 tgaggaggtt aaggttcaga gaggggtggg gaggaacgcc ggctcacatt ctccatcccc 
2521 tccagatatg accaggaaca gacctgtgcc agcctcagcc ttacatcaaa atgggcctcc 
2581 ccatgcaccg tggacctctg ggccctcctg tcccagtgga ggacaggaag ctgtgagggg 
2641 cactgtcacc cagggctcaa gctggcattc ctgaataatc gctctgcacc aggccacggc 
2701 taagctcagt gcgtgattaa gcctcataac cctccaaggc agttactagt gtgattccca 
2761 ttttacagat gaggaagatg gggacagaga ggtgaataac tggccccaaa tcacacacca 
2821 tccataattc gggctcaggc acctggctcc agtccccaaa ctcttgaacc tggccctagt 
2881 gtcactgttt ctcttgggtc tcaggcgctg gatggggaac aggaaacctg ggctgaactt 
2941 gaggcctctc tgatgctcgg tgacttcaga cagttgctca acctctctgt tctcttgggc 
3001 aaaacatgat aacctttgac ttctgtcccc tcccctcacc ccacccgacc ttgatctctg 
3061 aagtgttgga aggatttaat ttttcctgca ctgagttttg gagacaggtc aaaaagatga 
3121 ccaaggccaa ggtggccagt ttcctataga acgcctctaa aagacctgca gcaatagcag 
3181 caagaactgg tattctcgag aacttgctgc gcagcaggca cttcttggca ttttatgtgt 
3241 atttaatttc acaatagctc tatgacaaag tccacctttc tcatctccag gaaactgagg 
3301 ttcagagagg ttaagtaact tgtccaaggt cacacagcta atagcaagtt gacgtggagc 
3361 aatctggcct cagagccttt aattttagcc acagactgat gctcccctct tcatttagcc 
3421 aggctgcctc tgaagttttc tgattcaaga cttctggctt cagctttgta cacagagatg 
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3481 attcaatgtc aggttttgga gcgaaatctg tttaatccca gacaaaacat ttaggattac 
3541 atctcagttt tgtaagcaag tagctctgtg atttttagtg agttatttaa tgctctttgg 
3601 ggctcaattt ttctatctat aaaatagggc taataatttg caccttatag ggtaagcttt 
3661 gaggacagat tagatgatac ggtgcctgta aaacaccagg tgttagtaag tgtggcaatg 
3721 atggtgacgc tgaggctgtg tttgcttagc atagggttag gcagctggca ggcagtaaac 
3781 agttggataa tttaatggaa aatttgccaa actcagatgc tgttcactgc tgagcaggag 
3841 ccccttcctg ctgaaatggt cctggggagt gcagcaggct ctccgggaag aaatctacca 
3901 tctctcgggc aggagctcaa cctgtgtgca ggtacaggga gggcttcctc acctggtgcc 
3961 cactcatgca ttacgtcagt tattcctcat ccctgtccaa aggattcttt tctccattgt 
4021 acagctatga agctagtgct caaagaagtg aagtcattta ccccaggccc cctgccagta 
4081 agtgacaggg cctggtcaca cttgggttta tttattgccc agttcaacag gttgtttgac 
4141 cataggcgag attctcttcc ctgcaccctg ccgggttgct cttggtccct tattttatgc 
4201 tcctgggtag aaatggtgcg agattaggca gggagtggac gcttccctgt ccctggcccc 
4261 gcaaagagtg ctcccacctg ccccgatccc agaaatgtca ccatgaagcc ttcattcttt 
4321 tggtttaaag cttggcctca gtgtccgtac accatggggt ccttggccag atggcgactt 
4381 tctcctctcc agtcgccctc ccaggcacta gcttttagga gtgcagggtg ctgcctctga 
4441 tagaagggcc aggagagagc aggttttgga gacctgatgt tataaggaac agcttgggag 
4501 gcataatgaa cccaacatga tgcttgagac caatgtcaca gcccaattct gacattcatc 
4561 atctgagatc tgaggacaca gctgtctcag ttcatgatct gagtgctggg aaagccaaga 
4621 cttgttccag ctttgtcact gacttgctgt atagcctcaa' caaggccctg accctctctg 
4681 ggcttcaaac tcttcactgt gaaaggagga aaccagagta ggtgatgtga caccaggaaa 
4741 gatggatggg tgtgggggaa tgtgctcctc ccagctgtca ccccctcgcc accctccctg 
4801 caccagcctc tccacctcct ttgagcccag aattcccctg tctaggaggg cacctgtctc 
4861 gtgcctagcc atgggaattc tccatctgtt ttgctacatt gaacccagat gccattctaa 
4921 ccaagaatcc tggctgggtg caggggctct cgcctgtaac cccagcactt tgggaggcca 
4981 aggcaggcgg atcaagaggt caggagttca agacctgcct ggccaacacg gtgaaacctc 
5041 agctctacta aaaatacaaa aattagccag gcgtggtggc acacgcctgt aatcccagct 
5101 atttgggaag ctgagacaga agaatttctt gaacccggga ggtggaggtt tcagtgagcc 
5161 gagatcacgc cactgcactc caccctggcg gataaagcga gactctgtct caaaaaaaac 
5221 ccaaaaacct atgttagtgt acagagggcc ccagtgaagt cttctcccag ccccactttg 
5281 cacaactggg gagagtgagg ccccaggacc agaggattct tgctaaaggc caagtggata 
5341 gtgatggccc tgccaggcta gaagccacaa cctctggccc tgaggccact cagcatattt 
5401 agtgtcccca ccctgcagag gcccaactcc ctcctgacca ctgagccctg taatgatggg 
5461 ggaatttcca taagccatga aggactgcac aaagttcagt tgggagtgaa agagaaatta 
5521 aagggagatg gaaatataca gcactaattt tagcaccgtc ttcagttcta acaacactag 
5581 ctagctgaag aaaatacaaa catgtattat gtaatgtgtg gtctgttcca tttggattac 
5641 ttagaggcac gagggccaag gagaaaggtg gtggagagaa accagctttg cacttcattt 
5701 gttgctttat tggaaggaaa cttttaaaag tccaaggggg ttgaagaatc tcaatatttg 
5761 ttatttccag ctttttttct ccagtttttc atttcccaaa ttcaaggaca cctttttctt 
5821 tgtattttgt taagatgatg gttttggttt tgtgactagt agttaacaat gtggctgccg 
5881 ggcatattct cctcagctag gacctcagtt ttcccatctg tgaagacggc aggttctacc 
5941 tagggggctg caggcaggtg gtccgaagcc tgggcatatc tggagtagaa ggatcactgt 
6001 ggggcagggc aggttctgtg ttgctgtgga tgacgttgac tttgaccatt gctcggcaga 
6061 gcctgctctc gctggttcag ccacaggccc caccactccc tattgtctca gccccgggta 
6121 tgaaacatgt attcctcact ggcctatcac ctgaagcctt tgaatttgca acacctgcca 
6181 acccctccct caaaagagtt gccctctcta gatccttttg atgtaaggtt tggtgttgag 
6241 acttatttca ctaaattctc atacataaac atcactttat' gtatgaggca aaatgaggac 
6301 cagggagatg aatgacttgt cctggctcat acacctggaa agtgacagag tcagattaga 
6361 tcctaggtct atctgaagtt aaaagaggtg tcttttcact tcccacctcc tccatctact 
6421 ttaaagcagc acaaacccct gctttcaagg agagatgagc gtctctaaag cccctgacag 
6481 caagagccca gaactgggac accattagtg acccagacgg caggtaagct gactgcagga 
6541 gcatcagcct attcttgtgt ctgggaccac agagcattgt ggggacagcc ccgtctcttg 
6601 ggaaaaaaac cctaagggct gaggatcctt gtgagtgttg ggtgggaaca gctcccagga 
6661 ggtttaatca cagcccctcc atgctctcta gctgttgcca ttgtgcaaga tgcatttccc 
6721 ttctgtgcag cagtttccct ggccactaaa tagtgggatt agatagaagc cctccaaggg 
6781 ctccagcttg acatgattct tgattctgat ctgacccgat tctgataatc gtgggcaggc 
6841 ccattcctct tcttgtgcct cattttcttc ttttgtaaaa caatggctgt accatttgca 
6901 tcttagggtc attgcagatg aaagtgttgc tgtccagagc ctgggtgcag gacctagatg 
6961 taggattctg gttctgctac ttcctcagtg acattgaata gctgacctaa tctctctggc 
7021 tttggtttct tcatctgtaa aagaaggata ttagcattag cacctcacgg gattgttaca 
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Figure 10 (C) 



7081 agaaagcaat gaattaacac atgtgagcac ggagaacagt gcttggcata tggtaagcac 
7141 tacgtacatt ttgctattct Cctgattctt tcagtgttac t:gatgtcggc aagtacttgg 
7201 cacaggctgg ttt:aat:aatc cctaggcact ttcacgtggt gtcaatccct giatcactggg 
7261 agtcatcatg tgccttgact cgggcctggc ccccccatct ctgtcttgca ggacaatgcc 
7321 gtcttctgtc tcgtggggca tcctcctgct ggcaggcctg tgctgcctgg tccctgtctc 
7381 cctggctgag gatccccagg gagatgctgc ccagaagaca gatacaticcc accatgatca 
7441 ggatcaccca accttcaaca agatcacccc caacctggct gagttcgcct tcagcctata 
7501 ccgccagctg gcacaccagt ccaacagcac caatatcttc" ttctccccag tgagcatcgc 
7561 tacagccttt gcaatgctct ccctggggac caaggctgac actcacgatg aaatcctgga 
7621 gggcctgaat ttcaacctca cggagattcc ggaggctcag atccatgaag gcttccagga 
7681 actcctccgt accctcaacc agccagacag ccagctccag ctgaccaccg gcaatggcct 
7741 gttcctcagc gagggcctga agctagtgga taagtttttg gaggatgtta aaaagttgta 
7801 ccactcagaa gccttcactg tcaacttcgg ggacaccgaa gaggccaaga aacagatcaa 
7861 cgattacgtg gagaagggta ctcaagggaa aattgtggat ttggtcaagg agcttgacag 
7921 agacacagtt tttgctctgg tgaattacat cttctttaaa ggtaaggttg ctcaaccagc 
7981 ctgagctgtt tcccatagaa acaagcaaaa atatttctca aaccatcagt tcttgaactc 
8041 tccttggcaa tgcattatgg gccatagcaa tgcttttcag cgtggattct tcagttttct 
8101 acacacaaac actaaaatgt tttccatcat tgagtaattt gaggaaataa tagattaaac 
8161 tgtcaaaact actgacgctc tgcagaactt ttcagagcct ttaatgtcct tgtgtatact 
8221 gtatatgtag aatatataat gcttagaact atagaacaaa ttgtaataca ctgcataaag 
8281 ggatagtttc atggaacata ctttacacga ctctagtgtc ccagaatcag tatcagtttt 
8341 gcaatctgaa agacctgggt tcaaaccctg cctctaacac aattagcttt tgacaaaaac 
8401 aatgcattct acctctttga ggtgctaatt tctcatctta gcatggacaa aataccattc 
8461 ttgctgtcag gtt:ttt:t:t:ag gattaaacaa atgacaaaga ctgtggggat ggtgtgtggc 
8521 atacagcagg tgatggactc ttctgt:atct caggctgcct tcctgcccct gaggggttaa 
8581 aatgccaggg tcctgggggc cccagggcat: tctaagccag ctcccactgt cccaggaaaa 
8641 cagcataggg gaggggaggt gggaggcaag gccaggggct gcttcctcca ctctgaggct 
8701 cccttgctct tgaggcaaag gagggcagtg gaggcaagcc aggctgcagt cagcacagct 
8761 aaagtcctgg ctctgctgtg gccttagtgg gggcccaggt ccctctccag ccccagtctc 
8821 ctccttctgt ccaatgagaa agctgggatc aggggtccct gaggcccctg tccactctgc 
8881 atgcctcgat ggtgaagctc tgttggtatg gcagagggga ggctgctcag gcatctgcat 
8941 ttcccctgcc aatictagagg atgaggaaag ctctcaggaa tagtaagcag aatgtttgcc 
9001 ctggatgaat: aactgagctg ccaattaaca aggggcaggg agccttagac agaaggtacc 
9061 aaatatgcct gatgctccaa cattttattt gtaatatcca agacaccctc aaataaacat 
9121 atgattccaa taaaaatgca cagccacgat ggcatctctt agcctgacat cgccacgatg 
9181 tagaaattct gcatcttcct ctagttttga attatcccca cacaatcttt ttcggcagct 
9241 tggatggtca gtttcagcac cttttacaga tgatgaagct gagcctcgag ggatgtgtgt 
9301 cgtcaagggg gctcagggct tctcagggag gggactcatg gtttcttatt ctgctacact 
9361 cttccaaacc ttcactcacc cctggtgatg cccaccttcc cctctctcca ggcaaatggg 
9421 agagaccctt tgaagtcaag gacaccgagg aagaggactt ccacgtggac caggtgacca 
9481 ccgtgaaggt gcctatgatg aagcgtttag gcatgtttaa catccagcac tgtaagaagc 
9541 tgtccagctg ggtgctgctg atgaaatacc tgggcaatgc caccgccatc ttcttcctgc 
9601 ctgatgaggg gaaactacag cacctggtaa atgaactcac ccacgatatc atcaccaagt 
9661 tcctggaaaa Cgaagacaga aggtgattcc ccaacctgag ggtgaccaag aagctgccca 
9721 cacctcttag ccatgttggg actgaggccc atcaggactg gccagagggc tgaggagggt 
9781 gaaccccaca tccctgggtc actgctactc tgtataaact tiggcttccag aatgaggcca 
9841 ccactgagtt caggcagcgc cgtccatgct ccatgaggag aacagtaccc agggtgagga 
9901 ggtaaaggtc tcgtccctgg gaacttccca ctccagtgtg gacactgtcc . cttcccaata 
9961 tccagtgccc aaggcaggga cagcagcacc accacacgtt ctggcagaac caaaaaggaa 
10021 cagatigggct tcctggcaaa ggcagcagtg gagtgtggag ttcaagggta gaatgtccct 
10081 ggggggacgg gggaagagcc tgtgtggcaa ggcccagaaa agcaaggttc ggaattggaa 
10141 cagccaggcc atgttcgcag aaggcttgcg tttctctgtc actttatcgg tgctgttaga 
10201 ttgggtgtcc tgtagtaagt gatacttaaa catgagccac acattagtgt atgtgtgtgc 
10261 attcgtgatt atgcccatgc cctgctgatc tagttcgttt tgtacactgt aaaaccaaga 
10321 tgaaaataca aaaggtgtcg ggttcataat aggaatcgag gctggaattt ctctgttcca 
10381 tgccagcacc tcctgaggtc tctgctccag gggttgagaa agaacaaaga ggctgagagg 
10441 gtaacggatc agagagccca gagccagctg ccgctcacac cagaccctgc tcagggtggc 
10501 attgtctccc catggaaaac cagagaggag cactcagcct ggtgtggtca ctcttctctt 
10561 atccactaaa cggttgtcac tgggcactgc caccagcccc gtgtttctct gggtgtaggg 
10621 ccctggggat gttacaggct gggggccagg tgacccaaca ctacagggca agatgagaca 



wo 00/75279 




PCT/USOO/15728 



18/35 



Figure 10 (D) 



10681 ggcttccagg acacctagaa tatcagagga ggtggcattt caagcttttg tgattcattc 
10741 gatgttaaca ttctttgact caatgtagaa gagctaaaag tagaacaaac caaagccgag 
10801 ttcccatctt agtgtgggtg gaggacacag gagtaagtgg cagaaataat cagaaaagaa 
10861 aacacttgca ctgtggtggg tcccagaaga acaagaggaa tgctgtgcca tgccttgaat 
10921 ttcttttctg cacgacaggt ctgccagctt acatttaccc aaactgtcca ttactggaac 
10981 ctatgatctg aagagcgtcc tgggtcaact gggcatcact aaggtcttca gcaatggggc 
11041 tgacctctcc ggggtcacag aggaggcacc cctgaagctc tccaaggtga gatcaccctg 
11101 acgaccttgt tgcaccatgg tatctgtagg gaagaatgtg tgggggctgc agcactgtcc 
11161 tgaggctgag gaaggggccg agggaaacaa atgaagaccc aggctgagct cctgaagatg 
11221 cccgtgattc actgacacgg gacggtgggc aaacagcaaa gccaggcagg ggctgctgtg 
11281 cagctggcac tttcggggcc tcccttgagg ttgtgtcact gaccctgaat ttcaactttg 
11341 cccaagacct tctagacatt gggccttgat ttatccatac tgacacagaa aggtttgggc 
11401 taagttgttt caaaggaatt tctgactcct tcgatctgtg agatttggtg tctgaattaa 
11461 tgaatgattt cagctaaagt gacacttatt ttggaaaact aaaggcgacc aatgaacaac 
11521 ctgcagttcc atgaatggct gcattatctt ggggtctggg cactgtgaag gtcactgcca 
11581 gggtccgtgt cctcaaggag cttcaagccg tgtactagaa aggagagagc cctggaggca 
11641 gacgtggagt gacgatgctc ttccctgttc tgagttgtgg gtgcacctga gcagggggag 
11701 aggcgcttgt caggaagatg gacagagggg agccagcccc atcagccaaa gccttgagga 
11761 ggagcaaggc ctatgtgaca gggagggaga ggatgtgcag ggccagggcc gtccaggggg 
11821 agtgagcgct tcctgggagg tgtccacgtg agccttgctc gaggcctggg atcagcctta 
11881 caacgtgtct ctgcttctct cccctccagg ccgtgcataa ggctgtgctg accatcgacg 
11941 agaaagggac tgaagctgct ggggccatgt ttttagaggc. catacccatg tctatccccc 
12001 ccgaggtcaa gttcaacaaa ccctttgtct tcttaatgat tgaacaaaat accaagtctc 
12061 ccctcttcat gggaaaagtg gtgaatccca cccaaaaata actgcctctc gctcct:caac 
12121 ccctcccctc catccctggc cccctccctg gatgacatta aagaagggtt gagctggtcc 
12181 ctgcctgcat gtgatctgta aatccctggg atgttttctc tg 
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Figure 11 



1 caccagcatc atctcctcca attcatccag 
61 ggcggattgc ctcagatcac actatctcca 
121 atgtattcca atgtgatagg aactgtaacc 
181 ttgctgctca ttggcttctg ggactgcgtg 
241 acagccaagc cgcgggacat tcccatgaat 
301 aaggcaactg aggatgaggg ctcagaacag 
361 tgggaactgt ccaaggccaa ttcccgcttt 
421 tccaagaatg acaatgataa cattttcctg 
481 atgaccaagc tgggtgcctg taatgacacc 
541 gacaccatat ctgagaaaac atctgatcag 
601 cgactctatc gaaaagccaa caaatcctcc 
661 gacaaatccc ttaccttcaa tgagacctac 
721 aagctccagc ccctggactt caaggaaaat 
781 tgggtgtcca ataagaccga aggccgaatc 
841 gagctcactg ttctggtgct ggttaacacc 
901 ttcagccctg agaacacaag gaaggaactg 
961 gcatctatga tgtaccagga aggcaagttc 
1021 gtgcttgagt tgcccttcaa aggtgatgac 
1081 gagaagagcc tggccaaggt ggagaaggaa 
1141 gatgaattgg aggagatgat gctggtggtt 
1201 ttcagtttga aggagcagct gcaagacatg 
1261 tccaaactcc caggtattgt tgcagaaggc 
1321 cataaggcat ttcttgaggt aaatgaagaa 
1381 gtgattgctg gccgttcgct aaaccccaac 
1441 ctggttttta taagagaagt tcctctgaac 
1501 ccttgtgtta agtaaaatgt tcttattctt 
1561 agaagtaaaa ataaaCacaa actacttcca 



ctactctgcc catgaagata atagttttca 
cttgcccagc cctgtggaag attagcggcc 
tctggaaaaa ggaaggttta tcttttgtcc 
acctgtcacg ggagccctgt ggacatctgc 
cccatgtgca tttaccgctc cccggagaag 
aagatcccgg aggccaccaa ccggcgtgtc 
gctaccactt tctatcagca cctggcagat 
tcacccctga gtatctccac ggcttttgct 
ctccagcaac tgatggaggt atttaagttt 
atccacttct tctttgccaa actgaactgc 
aagttagtat cagccaatcg cctttttgga 
caggacatca gtgagttggt atatggagcc 
gcagagcaat ccagagcggc catcaacaaa 
accgatgtca ttccctcgga agccatcaat 
atttacttca agggcctgtg gaagtcaaag 
ttctacaagg ctgatggaga gtcgtgttca 
cgttatcggc gcgtggctga aggcacccag 
atcaccatgg tcctcatctt gcccaagcct 
ctcaccccag aggtgctgca ggagtiggctg 
cacatgcccc gcttccgcat tgaggacggc 
ggccttgtcg atctgt:t:cag ccctgaaaag 
cgagatgacc tctatgtctc agatgcattc 
ggcagtgaag cagctgcaag taccgctgtt 
agggtgactt* tcaaggccaa caggcccttc 
actattatct tcatgggcag agtagccaac 
tgcacctctt cctatttttg gtttgtgaac 
tctcacatt 
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1 ctgcaggggg gggggggggg gggggctgtc atggcggcag gacggcgaac ttgcagtatc 
61 t:ccacgaccc gcccctacag gtgccagtgc ctccagaatg tggcagctca caagcctcct 
123. gctgttcgtg gccacctggg gaatttccgg cacaccagct cctcttgact cagtgttctc 
181 cagcagcgag cgtgcccacc aggtgctgcg gatccgcaaa cgtgccaact ccttcctgga 
241 ggagctccgt cacagcagcc tggagcggga gtgcatagag gagatctgtg acttcgagga 
301 ggccaaggaa attttccaaa atgtggatga cacactggcc ttctggtcca agcacgtcga 
361 cggtgaccag tgcttggtct tgcccttgga gcacccgtgc gccagcctgt gctgcgggca 
421 cggcacgtgc atcgacggca tcggcagctt cagctgcgac tgccgcagcg gctgggaggg 
4 81 ccgcttctgc cagcgcgagg tgagcttcct caattgctcg ctggacaacg gcggctgcac 
541 gcattactgc ctagaggagg tgggctggcg gcgctgtagc tgtgcgcctg gctacaagct 
601 gggggacgac ctcctgcagt gtcaccccgc agtgaagttc ccttgtggga ggccctggaa 
661 gcggatggag aagaagcgca gtcacctgaa acgagacaca gaagaccaag aagaccaagt 
721 agatccgcgg ctcattgatg ggaagatgac caggcgggga gacagcccct ggcaggtggt 
781 cctgctggac tcaaagaaga agctggcctg cggggcagtg ctcatccacc cctcctgggt 
841 gctgacagcg gcccactgca tggatgagtc caagaagctc cttgtcaggc ttggagagta 
901 tgacctgcgg cgctgggaga agtgggagct ggacctggac atcaaggagg tcttcgtcca 
961 ccccaactac agcaagagca ccaccgacaa tgacatcgca ctgctgcacc tggcccagcc 
1021 cgccaccctc tcgcagacca tagtgcccat ctgcctcccg gacagcggcc ttgcagagcg 
1081 cgagctcaat caggccggcc aggagaccct cgtgacgggc tggggctacc acagcagccg 
1141 agagaaggag gccaagagaa accgcacctt cgtcctcaac ttcatcaaga ttcccgtggt 
1201 cccgcacaat gagtgcagcg aggtcatgag caacatggtg tctgagaaca tgc^gtgtgc 
1261 gggcatcctc ggggaccggc aggatgcctg cgagggcgac agtggggggc ccatggtcgc 
1321 ctccttccac ggcacctggt tcctggtggg cctggtgagc tggggtgagg gctigtgggct 
1381 ccttcacaac tacggcgttt acaccaaagt cagccgctac ctcgactgga tccatgggca 
1441 catcagagac aaggaagccc cccagaagag ctgggcacct tagcgaccct ccctgcaggg 
1501 ctgggctttt gcatggcaat ggatgggaca ttaaagggac atgtaacaag cacaccggcc 
1561 tgctgttctg tccttccatc cctcttttgg gctcttctgg agggaagtaa catt:t:actga 
1621 gcacctgttg tatgtcacat gccttatgaa tagaatctta actcctagag caactctg^g 
1681 gggtggggag gagcagatcc aagttttgcg gggtctaaag ctgtgtgtgt tgagggggat 
1741 actctgttta tgaaaaagaa taaaaaacac aaccacgaaa aaaaaaaaaa aaaaaaaaaa 
1801 aaaaaaaaaa aaaaaaaccc ccccccgccc cccccccctg cag 
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Figure 12 B 



1 agtgaatctg ggcgagtaac acaaaacttg agtgtcctta cctgaaaaat agaggttaga 
61 gggatgctat gtgccattgt gtgtgtgtgt tgggggtggg gattgggggt gatttgtgag 
121 caattggagg tgagggtgga gcccagtgcc cagcacctat gcactgggga cccaaaaagg 
181 agcatcttct catgatttta tgtatcagaa attgggatgg catgtcattg ggacagcgtc 
241 ttttttcttg tatggtggca cataaataca tgtgtcttat aattaatggt attttagatt 
301 tgacgaaata tggaatatta cctgttgtgc tgatcttggg caaactataa tatctctggg 
361 caaaaatgtc cccatctgaa aaacagggac aacgttcctc cctcagccag ccactatggg 
421 gctaaaatga gaccacatct gtcaagggtt ttgccctcac ctccctccct gctggatggc 
481 atccttggta ggcagaggtg ggcttcgggc agaacaagcc gtgctgagct aggaccagga 
541 gtgctagtgc cactgtttgt ctatggagag ggaggcctca gtgctgaggg ccaagcaaat 
601 atttgtggtt atggattaac tcgaactcca ggctgtcatg gcggcaggac ggcgaacttg 
661 cagtatctcc acgacccgcc cctgtgagtc cccctccagg caggtctatg aggggtgtgg 
721 agggagggct gcccccggga gaagagagct aggtggtgat gagggctgaa tcctccagcc 
781 agggtgctca acaagcctga gcttggggta aaaggacaca aggccctcca caggccaggc 
841 ctggcagcca cagtctcagg tccctttgcc atgcgcctcc ctctttccag gccaagggtc 
901 cccaggccca gggccattcc aacagacagt ttggagccca ggaccctcca ttctccccac 
961 cccacttcca cctttggggg tgtcggattt gaacaaatct cagaagcggc ctcagaggga 
1021 gtcggcaaga atggagagca gggtccggta gggtgtgcag aggccacgtg gcctatccac 
1081 tggggagggt tccttgatct ctggccacca gggctatctc tgtggccttt tggagcaacc 
1141 tggtggtttg gggcaggggt tgaatttcca ggcctaaaac cacacaggcc tggccttgag 
1201 tcctggctct gcgagtaatg catggatgta aacatggaga cccaggacct tgcctcagtc 
1261 ttccgagtct ggtgcctgca gtgtactgat ggtgtgagac cctactcctg gaggatgggg 
1321 gacagaatct. gatcgatccc ctgggttggt gacttccctg tgcaatcaac ggagaccagc 
1381 aagggttgga tttttaataa accacttaac tcctccgagt ctcagtttcc ccctctatga 
1441 aatggggttg acagcattaa taactacctc ttgggtggtt gtgagcctta actgaagtca 
1501 taatatctca tgtttactga gcatgagcta tgCgcaaagc ctgttttgag agctttatgt 
1561 ggactaactc ctttaattct cacaacaccc tttaaggcac agatacacca cgttatitcca 
1621 tccattttac aaatgaggaa actgaggcat ggagcagtta agcatcttgc ccaacattgc 
1681 cctccagtaa gtgctggagc tggaatttgc accgtgcagt ctggcttcat ggcctgccct 
1741 gtgaatcctg taaaaatitgt ttgaaagaca ccatgagtgt ccaatcaacg ttagctaata 
1801 ttctcagccc agtcaticaga ccggcagagg cagccacccc actgtcccca gggaggacac 
1861 aaacatcctg gcaccctctc cactgcattc tggagctgct ttctaggcag gcagtgtgag 
1921 ctcagcccca cgtagagcgg gcagccgagg ccttctgagg ctatgtctct agcgaacaag 
1981 gaccctcaat tccagcttcc gcctgacggc cagcacacag ggacagccct ttcattccgc 
2041 ttccacctgg gggtgcaggc agagcagcag cgggggtagc actgcccgga gctcagaagt 
2101 cctcctcaga caggtgccag tgcctccaga atgtggcagc tcacaagcct cctgctgttc 
2161 gtggccacct ggggaatttc cggcacacca gctcctcttg gtaaggccac cccaccccta 
2221 ccccgggacc cttgtggcct ctacaaggcc ctggtggcat ctgcccaggc cttcacagct 
2281 tccaccatct ctctgagccc tgggtgaggt gaggggcaga tgggaatggc aggaatcaac 
2341 tgacaagtcc caggtaggcc agctgccaga gtgccacaca ggggctgcca gggcaggcat. 
2401 gcgtgatggc agggagcccc gcgatgacct cctaaagctc cctcctccac acggggatgg 
2461 tcacagagtc ccctgggcct tccctctcca cccactcact ccctcaactg tgaagacccc 
2521 aggcccaggc taccgtccac actatccagc acagcctccc ctactcaaat gcacactggc 
2581 ctcatggctg ccctgcccca acccctttcc tggtctccac agccaacggg aggaggccat 
2641 gattcttggg gaggtccgca ggcacatggg cccctaaagc cacaccaggc tgttggtttc 
2701 atttgtgcct ttatagagct gtttatctgc ttgggacctg cacctccacc ctttcccaag 
2761 gtgccctcag ctcaggcata ccctcctcta ggatgccttt tcccccatcc cttcttgctc 
2821 acacccccaa cttgatctct ccctcctaac tgtgccctgc accaagacag acacttcaca 
2881 gagcccagga cacacctggg gacccttcct gggtgatagg tctgtctatc ctccaggtgt 
2941 ccctgcccaa ggggagaagc atggggaata cttggttggg ggaggaaagg aagactgggg 
3001 ggatgtgtca agaCggggct gcatgtggtg tactggcaga agagtgagag gatttaactt 
3061 ggcagccttt acagcagcag ccagggcttg agtacttatc tctgggccag gctgtattgg 
3121 atgttttaca tgacggtctc atccccatgt ttttggatga gtaaattgaa ccttagaaag 
3181 gtaaagacac tggctcaagg tcacacagag atcggggtgg ggttcacagg gaggcctgtc 
3241 catctcagag caaggcttcg tcctccaact gccatctgct tcctggggag gaaaagagca 
3301 gaggacccct gcgccaagcc atgacctaga attagaatga gtcttgaggg ggcggagaca 
3361 agaccttccc aggctctccc agctctgctt cctcagaccc cctcatggcc ccagcccctc 
3421 ttaggcccct caccaaggtg agctcccctc cctccaaaac cagactcagt gttctccagc 
3481 agcgagcgtg cccaccaggt gctgcggatc cgcaaacgtg ccaactcctt cctggaggag 
3541 ctccgtcaca gcagcctgga gcgggagtgc atagaggaga tctgtgactt cgaggaggcc 
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Figure 12 B (continued) 



3601 aaggaaattt tccaaaatgt ggatgacaca gtaaggccac catgggtcca gaggatgagg 
3661 ctcaggggcg agctggtaac cagcaggggc ctcgaggagc aggtggggac tcaatgctga 
3721 ggccctctta ggagttgtgg gggtggctga gtggagcgat taggatgctg gccctatgat 
3781 gtcggccagg cacatgtgac tgcaagaaac agaattcagg aagaagctcc aggaaagagt 
3841 gtggggtgac cctaggtggg gactcccaca gccacagtgt aggtggttca gtccaccctc 
3901 cagccactgc tgagcaccac tgcctccccg tcccacctca caaagagggg acctaaagac 
3961 caccctgctt ccacccatgc ctctgctgat cagggtgtgt gtgtgaccga aactcacttc 
4021 tgtccacata aaatcgctca ctctgtgcct cacatcaaag ggagaaaatc tgattgttca 
4081 gggggtcgga agacagggtc tgtgtcctat ttgtctaagg gtcagagtcc tttggagccc 
4141 ccagagtcct gtggacgtgg ccctaggtag tagggtgagc ttggtaacgg ggctggcttc 
4201 ctgagacaag gctcagaccc gctctgtccc tggggatcgc ttcagccacc aggacctgaa 
4261 aattgtgcac gcctgggccc ccttccaagg catccaggga tgctttccag tggaggcttt 
4321 cagggcagga gaccctctgg cctgcaccct ctcttgccct cagcctccac ctccttgact 
4381 ggacccccat ctggacctcc atccccacca cctctttccc cagtggcctc cctggcagac 
4441 accacagtga ctttctgcag gcacatatct gatcacatca agtccccacc gtgctcccac 
4501 ctcacccatg gtctctcagc cccagcagcc ttggctggcc tctctgatgg agcaggcatc 
4561 aggcacaggc cgtgggtctc aacgtgggct gggtggtcct ggaccagcag cagccgccgc 
4621 agcagcaacc ctggtacctg gttaggaacg cagaccctct gcccccatcc tcccaactct 
4681 gaaaaacact ggcttaggga aaggcgcgat gctcaggggt cccccaaagc ccgcaggcag 
4741 agggagtgat gggactggaa ggaggccgag tgacttggtg agggattcgg gtcccttgca 
4801 tgcagaggct gctgtgggag cggacagtcg cgagagcagc actgcagctg catggggaga 
4861 gggtgttgct ccagggacgt gggatggagg ctgggcgcgg gcgggtggcg ctggagggcg 
4921 ggggaggggc agggagcacc agctcctagc agccaacgac catcgggcgt cgatccctgt 
4981 ^tgtc^ggaa gccctcccct cccctgcccg ctcacccgct gccctgcccc acccgggcgc 
5041 gcccctccgc acaccggctg caggagcctg acgctgcccg ctctctccgc agctggcctt 
5101 ctggtccaag cacgtcggtg agtgcgttct agatccccgg ctggactacc ggcgcccgcg 
5161 cccctcggga tctctggccg ctgaccccct accccgcctt gtgtcgcaga cggtgaccag 
5221 tgcttggtct tgcccttgga gcacccgtgc gccagcctgt gctgcgggca cggcacgtgc 
5281 atcgacggca tcggcagctt cagctgcgac tgccgcagcg gctgggaggg ccgcttctgc 
5341 cagcgcggtg agggggagag gtggatgctg gcgggcggcg gggcggggct ggggccgggt 
5401 tgggggcgcg gcaccagcac cagctgcccg cgccctcccc tgcccgcaga ggtgagcttc 
5461 ctcaattgct ctctggacaa cggcggctgc acgcattact gcctagagga ggtgggctgg 
5521 cggcgctgta gctgtgcgcc tggctacaag ctgggggacg acctcctgca gtgtcacccc 
5581 gcaggtgaga agcccccaat acatcgccca ggaatcacgc tgggtgcggg gtgggcaggc 
5641 ccctgacggg cgcggcgcgg ggggctcagg agggtttcta gggagggagc gaggaacaga 
5701 gttgagcctt ggggcagcgg cagacgcgcc caacaccggg gccactgtta gcgcaatcag 
5761 cccgggagct gggcgcgccc tccgctttcc ctgcttcctt tcttcctggc gtccccgctt 
5821 cctccgggcg cccctgcgac ctggggccac ctcctggagc gcaagcccag tggtggctcc 
5881 gctccccagt ctgagcgtat ctggggcgag gcgtgcagcg tcctcctcca tgtagcctgg 
5941 ctgcgttttt ctctgacgtt gtccggcgtg catcgcattt ccctctttac ccccttgctt 
6001 ccttgaggag agaacagaat cccgattctg ccttcttcta tattttcctt tttatgcatt 
6061 ttaatcaaat ttatatatgt atgaaacttt aaaaatcaga gttttacaac tcttacactt 
6121 tcagcatgct gttccttggc atgggtcctt ttttcattca ttttcataaa aggtggaccc 
6181 ttttaatgtg gaaattccta tcttctgcct ctagggcatt tatcacttat ttcttctaca 
6241 atctcccctt tacttcctct attttctctt tctggacctc ccattattca gacctctttc 
6301 ctctagtttt attgtctctt ctatttccca tctctttgac tttgtgtttt ctttcaggga 
6361 actttctttt ttttcttttt ttttgagatg gagtttcact cttgttgtcc caggctggag 
6421 tgcaatgacg tgatctcagc tcaccacaac ctccgcctcc tggattcaag cgattctcct 
6481 gccgcagcct cccgagtagc tgggattaca ggcatgcgcc accacgccca gctaattttg 
6541 tgtttttagt agagaagggg tttctccgtg ttggtcaagc tggtcttgaa ctcctgacct 
6601 caggtgatcc acctgccttg gcctcctaaa gtgctgggat tacaggcgtg agccaccgcg 
6661 cccagcctct ttcagggaac tttctacaac tttataattc aattcttctg cagaaaaaaa 
6721 tttttggcca ggctcagtag ctcagaccaa taattccagc actttgagag gctgaggtgg 
6781 gaggattgct tgagcttggg agtttgagac tagcctgggc aacacagtga gaccctgtct 
6841 ctatttttaa aaaaagtaaa aaaagatcta aaaatttaac tttttatttt gaaataatta 
6901 gatatttcca ggaagctgca aagaaatgcc tggtgggcct gttggctgtg ggtttcctgc 
6961 aaggccgtgg gaaggccctg tcattggcag aaccccagat cgtgagggct ttccttttag 
7021 gctgctttct aagaggactc ctccaagctc ttggaggatg gaagacgctc acccatggtg 
7081 ttcggcccct cagagcaggg tggggcaggg gagctggtgc ctgtgcaggc tgtggacatt 
7141 tgcatgactc cctgtggtca gctaagagca ccactccttc ctgaagcggg gcctgaagtc 
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Figure 12 B (continued) 

7201 cctagtcaga gcctctggtt caccttctgc aggcagggag aggggagtca agtcagtgag 
7261 gagggctttc gcagtttctc ttacaaactc tcaacatgcc ctcccacctg cactgccttc 
7321 ctggaagccc cacagcctcc tatggttccg tggtccagtc cttcagcttc tgggcgcccc 
7381 catcacgggc tgagattttt gctttccagt ctgccaagtc agttactgtg tccatccatc 
7441 tgctgtcagc ttctggaatt gttgctgttg tgccctttcc attcttttgt tatgatgcag 
7501 ctcccctgct gacgacgtcc cattgctctt ttaagtctag atatctggac tgggcattca 
7561 aggcccattt tgagcagagt cgggctgacc tttcagccct cagttctcca tggagtatgc 
7621 gctctcttct tggcagggag gcctcacaaa catgccatgc ctattgtagc agctctccaa 
7681 gaatgctcac ctccttctcc ctgtaattcc tttcctctgt gaggagctca gcagcatccc 
7741 attatgagac cttactaatc ccagggatca cccccaacag ccctggggta caatgagctt 
7801 ttaagaagtt taaccaccta tgtaaggaga cacaggcagt gggcgatgct gcctggcctg 
7861 actcttgcca ttgggtggta ctgtttgttg actgactgac tgactgactg gagggggttt 
7921 gtaatttgta tctcagggat tacccccaac agccctgggg tacaatgagc cttcaagaag 
7981 tttaacaacc tatgtaagga cacacagcca gtgggtgatg ctgcctggtc tgactcttgc 
8041 cattcagtgg cactgtttgt tgactgactg actgactgac tggctgactg gagggggttc 
8101 atagctaata ttaatggagt ggtctaagta tcattggttc cttgaaccct gcactgtggc 
8161 aaagtggccc acaggctgga ggaggaccaa gacaggaggg cagtctcggg aggagtgcct 
8221 ggcaggcccc tcaccacctc tgcctacctc agtgaagttc ccttgtggga ggccctggaa 
8281 gcggatggag aagaagcgca gtcacctgaa acgagacaca gaagaccaag aagaccaagt 
8341 agatccgcgg ctcattgatg ggaagatgac caggcgggga gacagcccct ggcaggtggg 
8401 aggcgaggca gcaccggctc gtcacgtgct gggtccggga tcactgagtc catcctggca 
8461 gctatgctca gggtgcagaa accgagaggg aagcgctgcc attgcgtttg ggggatgatg 
8521 aaggtggggg atgcttcagg gaaagatgga cgcaacctga ggggagagga gcagccaggg 
8581 tgggtgaggg gaggggcatg ggggcatgga ggggtctgca ggagggaggg ttacagtttc 
8641 taaaaagagc tggaaagaca ctgctctgct ggcgggattt taggcagaag ccctgctgat 
8701 gggagagggc taggagggag ggccgggcct gagtacccct ccagcctcca catgggaact 
8761 gacacttact gggttcccct ctctgccagg catgggggag ataggaacca acaagtggga 
8821 gtatttgccc tggggactca gactctgcaa gggtcaggac cccaaagacc cggcagccca 
8881 gtgggaccac agccaggacg gcccttcaag ataggggctg agggaggcca aggggaacat 
8941 ccaggcagcc tgggggccac aaagtcttcc tggaagacac aaggcctgcc aagcctctaa 
9001 ggatgagagg agctcgctgg gcgatgttgg tgtggctgag ggtgactgaa acagtatgaa 
9061 cagtgcagga acagcatggg caaaggcagg aagacaccct gggacaggct gacactgtaa 
9121 aatgggcaaa aatagaaaac gccagaaagg cctaagccta tgcccatatg accagggaac 
9181 ccaggaaagt gcatatgaaa cccaggtgcc ctggactgga ggctgtcagg aggcagccct 
9241 gtgatgtcat catcccaccc cattccaggt ggtcctgctg gactcaaaga agaagctggc 
9301 ctgcggggca gtgctcatcc acccctcctg ggtgctgaca gcggcccact gcatggatga 
9361 gtccaagaag ctccttgtca ggcttggtat gggctggagc caggcagaag ggggctgcca 
9421 gaggcctggg tagggggacc aggcaggctg ttcaggtttg ggggaccccg ctccccaggt 
9481 gcttaagcaa gaggcttctt gagctccaca gaaggtgttt ggggggaaga ggcctatgtg 
9541 cccccaccct gcccacccat gtacacccag tattttgcag tagggggttc tctggtgccc 
9601 tcttcgaatc tgggcacagg tacctgcaca cacatgtttg tgaggggcta cacagacctt 
9661 cacctctcca ctcccactca tgaggagcag gctgtgtggg cctcagcacc cttgggtgca 
9721 gagaccagca aggcctggcc tcagggctgt gcctcccaca gactgacagg gatggagctg 
9781 tacagaggga gccctagcat ctgccaaagc cacaagctgc ttccctagca ggctgggggc 
9841 tcctatgcat tggccccgat ctatggcaat ttctggaggg ggggtctggc tcaactcttt 
9901 atgccaaaaa gaaggcaaag catattgaga aaggccaaat tcacatttcc tacagcataa 
9961 tctatgccag tggccccgtg gggcttggct tagaattccc aggtgctctt cccagggaac 
10021 catcagtctg gactgagagg accttctctc tcaggtggga. cccggccctg tcctccctgg 
10081 cagtgccgtg ttctgggggt cctcctctct gggtctcact gcccctgggg tctctccagc 
10141 tacctttgct ccatgttcct ttgtggctct ggtctgtgtc tggggtttcc aggggtctcg 
10201 ggcttccctg ctgcccattc cttctctggt ctcacggctc cgtgactcct gaaaaccaac 
10261 cagcatccta cccctttgga ttgacacctg ttggccactc cttctggcag gaaaagtcac 
10321 cgttgatagg gttccacggc atagacaggt ggctccgcgc cagtgcctgg gacgtgtggg 
10381 tgcacagtct ccgggtgaac cttcttcagg ccctctccca ggcctgcagg ggcacagcag 
10441 tgggtgggcc tcaggaaagt gccactgggg agaggctccc cgcagcccac tctgactgtg 
10501 ccctctgccc tgcaggagag tatgacctgc ggcgctggga gaagtgggag ctggacctgg 
10561 acatcaagga ggtcttcgtc caccccaact acagcaagag caccaccgac aatgacatcg 
10621 cactgctgca cctggcccag cccgccaccc tctcgcagac catagtgccc atctgcctcc 
10681 cggacagcgg ccttgcagag cgcgagctca atcaggccgg ccaggagacc ctcgtgacgg 
10741 gctggggcta ccacagcagc cgagagaagg aggccaagag aaaccgcacc ttcgtcctca 
10801 acttcatcaa gattcccgtg gtcccgcaca atgagtgcag cgaggtcatg agcaacatgg 
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Figure 12 B (continued) 

10861 tgtctgagaa catgctgtgt gcgggcatcc tcggggaccg gcaggatgcc tgcgagggcg 
10921 acagtggggg gcccatggtc gcctccttcc acggcacctg gttcctggtg ggcctggtga 
10981 gctggggtga gggctg^ggg ctccttcaca actacggcgt ttacaccaaa gtcagccgct 
11041 acctcgactg gatccatggg cacatcagag acaaggaagc cccccagaag agctgggcac 
11101 cttagcgacc ctccctgcag ggctgggctt ttgcatggca atggatggga cattaaaggg 
11161 acatgtaaca agcacaccgg cctgctgttc tgtccttcca tccctctttt gggctcttct 
11221 ggagggaagt aacatttact gagcacctgt Cgtatgtcac atgccttatg aatagaatct 
11281 taactcctag agcaactctg tggggtgggg aggagcagat ccaagttttg cggggtctaa 
11341 agctgtgtgt gttgaggggg atactctgtt tatgaaaaag aataaaaaac acaaccacga 
11401 agccactaga gccttttcca gggctttggg aagagcctgt gcaagccggg gatgctgaag 
11461 gtgaggcttg accagctttc cagctagccc agctatgagg tagacatgtt tagctcatat 
11521 cacagaggag gaaactgagg ggtctgaaag gtttacatgg tggagccagg attcaaatct 
11581 aggtctgact ccaaaaccca ggtgcttttt tctgttctcc actgtcctgg aggacagctg 
11641 tttcgacggt gctcagtgtg gaggccacta ttagctctgt agggaagcag ccagagaccc 
11701 agaaagtgtt ggttcagccc agaat 
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Figure 13 (A) 



SEQ ID NO:3 



ggcctctc actaactaat cactttccca tcttttgtta gatttgaata tatacattct 
atgatcattg ctttttctct titacagggga gaatttcata ttttacctga gcaaattgat 
tagaaaatgg aaccactaga ggaatataat gtgttaggaa attacagtca tttctaaggg 
cccagccctt gacaaaattg tgaagttaaa ttctccactc tgtccatcag atactatggt: 
tctccactat ggcaactaac tcactcaact: ttccctcctt agcagcattc catcttcccg 
atcttctttg cttctccaac caaaacatca atgtttatta gttctgtata cagtacagga 
tctttggtct actctatcac aaggccagta ccacactcat gaagaaagaa cacaggagtia 
gctgagaggc taaaactcat caaaaacact actccttttc ctctacccta ttcctcaatc 
ttttaccttt tccaaatccc aatccccaaa tcagtttttc tctttcttac tccctctctc 
ccttttaccc tccatggtcg ttaaaggaga gatggggagc atcattctgt tatacttctg 
tacacagtta tacatgtcta tcaaacccag acttgcttcc aCagtggaga cttgcttttc 
agaacatagg gatgaagtaa ggtgcctgaa aagtttgggg gaaaagtttc tttcagagag 
ttaagttatt ttatatatat aatatatata taaaatatat aatatacaat ataaatatat 
agtgtgtgtg tgtatgcgtg tgtgtagaca cacacgcata cacacatata atggaagcaa 
taagccattc taagagcttg tatggttatg gaggtctgac taggcatgat ttcacgaagg 
caagattggc atatcattgt aactaaaaaa gctgacattg acccagacat attgtactct 
ttctaaaaat aataataata atgctaacag aaagaagaga accgttcgtt tgcaatctac 
agctagtaga gactttgagg aagaattcaa cagtgtgtct tcagcagtgt tcagagccaa 
gcaagaagtt gaagttgcct agaccagagg acataagtat catgtctcct ttaactagca 
taccccgaag tggagaaggg tgcagcaggc tcaaaggcat aagtcattcc aaticagccaa 
ctaagttgtc cttttctggt ttcgtgttca ccatggaaca ttttgattat agttaatcct 
tctatcttga atctt 



ggcctctg actaactaat cactttccca tcttttgtta gatttgaata tatacattct 
atgatcattg ctttttctct ttacagggga gaatttcata ttttacctga gcaaattgat 
tagaaaatgg aaccactaga ggaatataat gtgttaggaa attacagtca tttctaaggg 
cccagccctt gacaaaattg tgaagttaaa ttctccactc tgtccatcag atactatggt 
tctccactat ggcaactaac tcactcaatt ttccctcctt agcagcattc catcttcccg 
atcttctttg cttctccaac caaaacatca atgtttatta gttctgtata cagtacagga 
tctttggtct actctatcac aaggccagta ccacactcat gaagaaagaa cacaggagta 
gctgagaggc taaaactcat caaaaacact actccttttc ctctacccta ttcctcaatc 
ttttaccttt tccaaatccc aatccccaaa tcagtttttc tctttcttac tccctctctc 
ccttttaccc tccatggtcg ttaaaggaga gatggggagc atcattctgt tatacttctg 
tacacagtta tacatgtcta tcaaacccag acttgcttcc atagtggaga cttgcttttc 
agaacatagg gatgaagtaa ggtgcctgaa aagtttgggg gaaaagtttc tttcagagag 
ttaagttatt ttatatatat aatatatata taaaatatat aatatacaat ataaatatat 
agtgtgtgtg tgtatgcgtg tgtgtagaca cacacgcata cacacatata atggaagcaa 
taagccattc taagagcttg tatggttatg gaggtctgac taggcatgat ttcacgaagg 
caagattggc atatcattgt aactaaaaaa gctgacattg acccagacat attgtactct 
ttctaaaaat aataataata atgctaacag aaagaagaga accgttcgtt tgcaatctac 
agctagtaga gactttgagg aagaattcaa cagtgtgtct tcagcagtgt tcagagccaa 
gcaagaagtt gaagttgcct agaccagagg acataagtat catgtctcct ttaactagca 
taccccgaag tggagaaggg tgcagcaggc tcaaaggcat aagtcattcc aatcagccaa 
ctaagttgtc cttttctggt ttcgtgttca ccatggaaca ttttgattat agttaatcct 
tctatcttga atctt 



SEQ ID NO: 76 
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Figure 13 (B) 



SEQ ID NO:77 



ggcctcCc actaactaat cactttccca tcttttgtta gatittgaata tatacattct 
atgatcattg ctttttctct ttacagggga gaatttcata ttttacctga gcaaattgat 
tagaaaatgg aaccactaga ggaatataaC gtgttaggaa attacagtca tttctaaggg 
cccagccctt gacaaaattg tigaagttaaa ttctccactc tgtccatcag atactatggt 
tctccactat ggcaactaac ticactcaaat ttccctcctt agcagcattc catcttcccg 
atcttctttg cttctccaac caaaacatca atgtttatta gttctgtata cagtacagga 
tctttggtct actctatcac aaggccagta ccacactcat gaagaaagaa cacaggagta 
gctgagaggc taaaactcat caaaaacact actccttttc ctctacccta ttcctcaatc 
ttttaccttt tccaaatccc aatccccaaa tcagtttttc tctttcttac tccctctctc 
ccttttaccc tccatggtcg ttaaaggaga gatggggagc atcattctgt tatacttctg 
tacacagtta tacatgtcta t:caaacccag acttgcttcc atagtggaga cttgcttttc 
agaacatagg gatgaagtaa ggtgcctgaa aagtttgggg gaaaagtttc tttcagagag 
ttaagttatt ttatatatat aatatatata taaaatatat aatatacaat ataaatatat 
^g^gt^gt^9^9 tgtatgcgtg tgtgtagaca cacacgcata cacacatata atggaagcaa 
taagccattc taagagcttg tiatggttatg gaggtctgac taggcatgat ttcacgaagg 
caagattggc atatcaCtgt aactiaaaaaa gctgacattg acccagacat attgtactct 
ttctaaaaat aataataata atgctaacag aaagaagaga accgttcgtt tgcaatctac 
agctagtaga gacttCgagg aagaattcaa cagtgtgtct tcagcagtgt tcagagccaa 
gcaagaagtt gaagttgcct agaccagagg acataagtat catgtctcct ttaactagca 
taccccgaag tggagaaggg ^gcagcaggc tcaaaggcat aagtcattcc aatcagccaa 
ctaagttgtc cttttctggt ^tcgtgttca ccatggaaca ttttgattat agttaatcct 
tctatcttga atctt 



ggcctctc actaactaat cactttccca tcttttgtta gatttgaata tatacattct 
atgatcattg ctttttctct t:tacagggga gaatttcata ttttacctga gcaaattgat 
tagaaaatgg aaccactaga ggaatataat gtgttaggaa attacagtca tttctaaggg 
cccagccctt gacaaaattg tgaagttaaa ttctccactc tgtccatcag atactatggt 
tctccactat ggcaactaac tcactcaatt ttccctcctt agcagcattc catcttcccg 
atcttctttg cttctccaac caaaacatca atgtttatta gttctgtata cagtacagga 
tctttggtct actctatcac aaggccagta ccacactcat gaagaaagaa cacaggagta 
gctgagaggc taaaactcat caaaaacact actccttttc ctctacccta ttcctcaatc 
ttttaccttt tccaaatccc aatccccaaa tcagtttttc tctttcttac tccctctctc 
ccttttaccc tccatggtcg ttaaaggaga gatggggacg atcattctgt tatacttctg 
tacacagtta tacatgtcta tcaaacccag acttgcttcc atagtggaga cttgcttttc 
agaacatagg gatgaagtaa ggtgcctgaa aagtttgggg gaaaagtttc tttcagagag 
ttaagttatt ttatatatat aatatatata taaaatatat aatatacaat ataaatatat 
agtgtgtgtg tgtatgcgtg tgtgtagaca cacacgcata cacacatata atggaagcaa 
taagccattc taagagcttg tatggttatg gaggtctgac taggcatgat ttcacgaagg 
caagattggc atatcattgt aactaaaaaa gctgacattg acccagacat attgtactct 
ttctaaaaat aataataata atgctaacag aaagaagaga accgttcgtt tgcaatctac 
agctagtaga gactttgagg aagaattcaa cagtgtgtct tcagcagtgt tcagagccaa 
gcaagaagtt gaagttgcct agaccagagg acataagtat catgtctcct ttaactagca 
taccccgaag tggagaaggg tgcagcaggc tcaaaggcat aagtcattcc aatcagccaa 
ctaagttgtc cttttctggt ttcgtgttca ccatggaaca ttttgattat agttaatcct 
tctatcttga atctt 



SEQ ID NO: 78 
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Figure 13 (C) 



SEQ ID NO:79 



ggcctctc actaactaat cactttccca tcttttgtta gatttgaata tiatacattct 
atgatcattg ctttttctct ttacagggga gaatttcata ttttacctga gcaaattgat 
tagaaaatgg aaccactaga ggaatataat gtgttaggaa attacagtca tittctaaggg 
cccagccctt gacaaaattg tgaagttaaa ttctccactc tgtccatcag atactatggt 
tctccactat ggcaactaac tcactcaatt ttccctcctt agcagcattc catcttcccg 
atcttctttg cttctccaac caaaacatca atgtttatta gttctgtata cagtacagga 
tctttggtct actctatcac aaggccagta ccacactcat gaagaaagaa cacaggagta 
gctgagaggc taaaactcat caaaaacact actccttttc ctctacccta ttcctcaatc 
ttttaccttt tccaaatccc aatccccaaa tcagtttttc tctttcttac tccctctctc 
ccttttaccc tccatggtcg ttaaaggaga gatggggagc atcattctgt tatacttctg 
tacacagtta tacatgtcta tcaaacccag acttgcttcc atagtggaga cttgcttttc 
agaacatagg gatgaagtaa ggtgcctgaa aagtttgggg gaaaagtttc tttcagagag 
ttaagttat:t ttatatatat aatatatata taaaatatat aatatacaat ataaatatat 
agtgtgtgtg tigtatgcgtg tgtgtagaca cacacgca^a cacacatata atggaagcaa 
taagccattc taagagcttg tatggttatg gaggtctgac taggcatgat ttcacgaagg 
caagattggc atatcattgt aactaaaaaa gctgacattg acccagacat attgtactct: 
ttctaaaaat aataataata atgctaacag aaagaagaga accgttcgtt tgcaatctac 
agctagtaga gactttgagg aagaattcaa cagtgtgtct: tcagcagtgt tcagagccaa 
gcaagaagt:t gaagttgcct agaccagagg acataagCat gtactctcct ttaactagca 
taccccgaag tggagaaggg tgcagcaggc tcaaaggcat aagtcatticc aatcagccaa 
ctaagttgtc cttttctggt ttcgtgttca ccatggaaca ttttgattat agttaatcct 
tctatcttga atctt 



ggcctctc actaactaat cactttccca tcttttgtta gatttgaata tatacattct 
atgatcattg ctttttctct ttacagggga gaatttcata ttttacctga gcaaattgat 
tagaaaatgg aaccactaga ggaatataat gtgttaggaa attacagtca tttctaaggg 
cccagccctt gacaaaattg tgaagttaaa ttctccactc tgtccatcag atactatggt 
tctccactat ggcaactaac tcactcaatt ttccctcctt agcagcattc catcttcccg 
atttcttctttg cttctccaac caaaacatca atgtttatta gttctgtata cagtacagga 
tctttggtct actctatcac aaggccagta ccacactcat gaagaaagaa cacaggagta 
gctgagaggc taaaactcat caaaaacact actccttttc ctctacccta ttcctcaatc 
ttttaccttt tccaaatccc aatccccaaa tcagtttttc tctttcttac tccctctctc 
ccttttaccc tccatggtcg ttaaaggaga gatggggagc atcattctgt tatacttctg 
tacacagtta tacatgtcta tcaaacccag acttgcttcc atagtggaga cttgcttttc 
agaacatagg gatgaagtaa ggtgcctgaa aagtttgggg gaaaagtttc tttcagagag 
ttaagttatt ttatatatat aatatatata taaaatatat aatatacaat ataaatatat 
agtgtgtgtg tgtatgcgtg tgtgtagaca cacacgcata cacacatata atggaagcaa 
taagccattc taagagcttg tatggttatg gaggtctgac taggcatgat ttcacgaagg 
caagattggc atatcattgt aactaaaaaa gctgacattg acccagacat attgtactct 
ttctaaaaat aataataata atgctaacag aaagaagaga accgttcgtt tgcaatctac 
agctagtaga gactttgagg aagaattcaa cagtgtgtct tcagcagtgt tcagagccaa 
gcaagaagtt gaagttgcct agaccagagg acataagtat catgtctcct ttaactagca 
taccccgaag tggagaaggg tgcagcaggc tcaaaggcat aagtcattcc aatcagccaa 
ctaagttgtc cttttctggt ttcgtgttca ccatggaaca ttttgattat agttaatcct 
tctatcttga atctt 
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Figure 13 (D) 



SEQ ID NO:81 

ggcctctc actaactaat cactttccca tcttttgtta gatttgaata tatacattct 
atgatcattg ctttttctct ttacagggga gaatttcata ttttacctga gcaaattgat 
tagaaaatgg aaccactaga ggaatataat gtgttaggaa attacagtca tttctaaggg 
cccagccctt gacaaaattg tgaagttaaa ttctccactc tgtccatcag atactatggt 
tctccactat ggcaactaac tcactcaatt ttccctcctt agcagcattc catcttcccg 
atcttctttg cttctccaac caaaacatca atgtttatta gttctgtata cagtacagga 
. tctttggtct actctatcac aaggccagta ccacactcat gaagaaagaa cacaggagta 
gctgagaggc taaaactcat caaaaacact actccttttc ctctacccta ttcctcaatc 
ttttaccttt tccaaatccc aatccccaaa tcagtttttc tctttcttac tccctctctc 
ccttttaccc tccatggtcg ttaaaggaga gatggggagc atcattctgt tatacttctg 
tacacagtta tacatgtcta tcaaacccag acttgcttcc atagtggaga cttgcttttc 
agaacatagg gatgaagtaa ggtgcctgaa aagtttgggg gaaaagtttc tttcagagag 
ttaagttatt ttatatatat aatatatata taaaatatat aatatacaat ataaatatat 
agtgtgtgtg tgtatgcgtg tgtgtagaca cacacgcata cacacatata atggaagcaa 
taagccattc taagagcttg tatggttatg gaggtctgac taggcatgat ttcacgaagg 
caagattggc atatcattgt aactaaaaaa gctgacattg acccagacat attgtactct 
ttctaaaaat aataataata atgctaacag aaagaagaga accgttcgtt tgcaatctac 
agctagtaga gactttgagg aagaattcaa cagtgtgtct tcagcagtgt tcagagccaa 
gcaagaagtt gaagttgcct agaccagagg acataagtat catgtctcct ttaactagca 
taccccgaag tiggagaaggg tgcagcaggc tcaaaggcat aagtcattcc aatcagccaa 
ctgctaagttgtc cttttctggt ttcgtgttca ccatggaaca ttttgattat agttaatcct 
tctatcttga atctt 



SEQ ID NO: 82 

ggcctctc actaactaat cactttccca tcttttgtta gatttgaata tatacattct 
atgatcattg ctttttctct ttacagggga gaatttcata ttttacctga gcaaattgat 
tagaaaatgg aaccactaga ggaatataat gtgttaggaa attacagtca tttctaaggg 
cccagccctt gacaaaattg tgaagttaaa ttctccactc tgtccatcag atactatggt 
tctccactat ggcaactaac tcactcaatt ttccctcctt agcagcattc catcttcccg 
atcttctttg cttctccaac caaaacatca atgtttatta gttctgtata cagtacagga 
tctttggtct actctatcac aaggccagta ccacactcat gaagaaagaa cacaggagta 
gctgagaggc taaaactcat caaaaacact actccttttc ctctacccta ttcctcaatc 
ttttaccttt tccaaatccc aatccccaaa tcagtttttc tctttcttac tccctctctc 
ccttttaccc tccatggtcg ttaaaggaga gatggggagc atcattctgt tatacttctg 
tacacagtta tacatgtcta tcaaacccag acttgcttcc atagtggaga cttgcttttc 
agaacatagg gatgaagtaa ggtgcctgaa aagtttgggg gaaaagtttc tttcagagag 
ttaagttatt ttatatatat aatatatata taaaatatat aatatacaat ataaatatat 
agtgtgtgtg tgtatgcgtg tgtgtagaca cacacgcata cacacatata atggaagcaa 
taagccattc taagagcttg tatggttatg gaggtctgac taggcatgat ttcacgaagg 
caagattggc atatcattgt aactaaaaaa gctgacattg acccagacat attgtactct 
ttctaaaaat aataataata atgctaacag aaagaagaga accgttcgtt tgcaatctac 
agctagtaga gactttgagg aagaattcaa cagtgtgtct tcagcagtgt tcagagccaa 
gcaagaagtt gaagttgcct agaccagagg acataagtat catgtctcct ttaactagca 
taccccgaag tggagaaggg tgcagcaggc tcaaaggcat aagtcattcc aatcagccaa 
ctaagttgtc cttttctggt ttcgtgttca ccatggaaca ttttgattat agttatcct 
tctatcttga atctt 
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Figure 13 (E) 



SEQ ID NO:83 



gcctctc actaactaat cactttccca tcttttgtta gatttgaata tatacattct 
atgatcattg ctttttctct ttacagggga gaatttcata ttttacctga gcaaattgat 
tagaaaatgg aaccactaga ggaatataat gtgttaggaa attacagtca tttctaaggg 
cccagccctt gacaaaattg tgaagttaaa ttctccactc tgtccatcag atactatggt 
tctccactat ggcaactaac tcactcaatt ttccctcctt agcagcattc catcttcccg 
atcttctttg cttctccaac caaaacatca atgtttatta gttctgtata cagtacagga 
tctttggtct actctatcac aaggccagta ccacactcat: gaagaaagaa cacaggagta 
gctgagaggc taaaactcat caaaaacact actccttttc ctctacccta ttcctcaatc 
ttttaccttt tccaaatccc aatccccaaa tcagtttttc tctttcttac tccctctctc 
ccttttaccc tccatggtcg ttaaaggaga gatggggagc atcattctgt tatacttctg 
tacacagtta tacatgtcta tcaaacccag acttgcttcc atagtggaga cttgcttttc 
agaacatagg gatgaagtaa ggtgcctgaa aagtttgggg gaaaagtttc tttcagagag 
ttaagttatt ttatatatat aatatatata taaaatatat^ aatatacaat ataaatatat 
agtgtgtgtg tgtatgcgtg tgtgtagaca cacacgcata cacacatata atggaagcaa 
taagccattc taagagcttg tatggttatg gaggtctgac taggcatgat ttcacgaagg 
caagattggc atatcattgt aactaaaaaa gctgacattg acccagacat attgtactct 
ttctaaaaat aataataata atgctaacag aaagaagaga accgttcgtit tgcaatctac 
agctagtaga gactttgagg aagaattcaa cagtgtgtct tcagcagtgt tcagagccaa 
gcaagaagtt gaagttgcct agaccagagg acataagtat catgtctcct ttaactagca 
taccccgaag tggagaaggg tgcagcaggc tcaaaggcat aagtcattcc aatcagccaa 
ctaagttgtc cttttctggt ttcgtgttca ccatggaaca ttttgattat agttaatcct 
tctatcttga atctt 
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Figure 14 



GAATTCTOtA ASCATTTCCT ATGtOTACCT GCCCCTGGGC AAGGTGGGCC TGACTTGnA -1403 
GMStGTTlUSJ^ GTTTTACCCT CTTCXTCTWS GACGGCCTGG TACCACCACA GCCCAGCATC -1343 
GT6TGGTGCC TCAGCAGGAG CCATCTGCTT ACAATCAACA CAASCTGTTC CAGCCAATTT -12B3 
AAAGAAACTT CAGGA6GAAT ACGCTTTTAG GAGGGCATGG GGACeCTCCT GCACCCGAA6 -1223 
CCAfiGASGTG CCACCAAICA XAAGGAGGCA GGG6CCTCCT TCCGCTGCTC OCTGGGACTC -1163 
TCTAGGT6TC CCTGCCCTCA CCCCCCCTCT GCACACCTGC ATCTTCCTTC TCATCMCTT -1103 
CCTCTGCTTT AAGC6TAAAC ATGGATGCCC AGGACCTGOC CTCAAafCTTC CGAOTCTGeT -1043 
ACTTATGGT6 TACTGACASt GTGACSACCCT ACtCCTCTGA TCAATCCCCT GGCTTGGTGA -983 
CTTCCCTGTC CAATCAATGG AAGCCAGCGA CGCAGGGTCA CATCCCCCtST TTAGAGCTCC -923 
AGACTTGGAG AAGGAACGTG GGCAAGTCTT CCCAGGAACA GGTAGGGCAC GGAGSAAASG -863 
GGGOCATCTC TCGTGCAGCC CGGTTCGGAG CAGGAAGAC6 CTTRATAAAT GCTGATAGAC -803 
VGCAGGACAC ACGCAAAGGT GCTGAGCTCG ACCCTTTATT TCTCCCCTTC TCCCTTCTGC -7 43 
CACCCCGGCC AGGAAATTGC TGCAGCCTTT CTGGAATCCC CTTCATTTTT CTTACTGCTC -6«3 
CACAAAAGGG GCCAAATGGA AGCAGCAACA CCTGA6TTCA AATTAAATCT GCCAACTACC -623 
AGCTCAGTGA ATCTGGGCGA GTAACACAAA ACTTGASIGr CCTTACCTGA AAAATAGAGG -563 
TTAGAGGGA9 GCTATGTGCC ATTGTGTGTG TGTGTTGGGG GTGGGGATTG CGGGTGATTT -503 
GTGAGCAATT G6AGGTGAGG GTGGA6CCCA GTCCCCAGCA CCTATGCACT GGGGACCCAA -443 
AAAGGAGCAT CTTCTCAXGA TTTTATCIAT CACAAATTGG GATCCCATOT CATTGGGACA -383 
GCGTCTTTTT TCTTGIATG6 TGGCACATAA ATACATCTGT CTTATAATTA ATGGTATTTT -323 
AfiATTTGACG AAATATG6AA TATTACCTGT TGTGCtGASC TTGGGCAAAC TATAATATCT -263 
CTGGGCAAAA ATGTCCCCAT CT G AA A AACA GGGACAACGT TCCTCCCTCA GCCAGCGACT -203 
AXGGGGC7AA AATGAGACCA CATCTCTCAA CCGTTTTGCC CTCACCTCCC TCCCTGCTGG -14 3 
ATGGCATCCT TGGTAGGCAG AGGTGGGCTT CGGGCAfiAAC AACCCG1GCT GAGCTAGGAC -83 
CAGGASTGCT AfiTGCCACTG TTTOTCTATC GASAGGGM6G CCTCAfiTGCT GAGGGCCAA6 -23 
CAAATAtTTG TGGTTATGGK T TAACTCCAA CTCcScGCTG TCATGGCGCC AGGACGGCGA +38 
ACTTGCAgTC TC^CACGAC CCCCCCCT GT GASTCCCCCT CCAGGCAISGT CTATGAGGGG +98 

TGTGGftGGGA CGCCTGCCCC CGCGAGAAGA 

1350 bp 

KBT TAP GLH LEU -39 

AAGAAGTCCT CCTCAfiMCAG GTGCCAGTGC CTCCACAATO TGG CAG CTC ♦1527 
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Figure 15 
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Figure 16 
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Figure 17A 
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SEQUENCE LISTING 

<110> Kurachi, Kotoku 
Karachi , Sumiko 

<120> Nucleotide Sequences for Gene Regulation and Methods of 
Use Thereof 

<130> UM-04305 

<140> XX 

<141> 2000-06-06 

<150> 09/328,925 
15 <151> 1999-06-09 

<160> 208 

<170> Patent In Ver. 2.0 

20 

<210> 1 * 
<211> 32 
<212> DNA 

<213> Homo sapiens 

25 

<400> 1 

agccattcag tcgaggaagg atagggtggt at 32 

<210> 2 
30 <211> 7 

<212> DNA 

<213> Homo sapiens 



<400> 2 . 
35 gaggaag 

<210> 3 
<211> 1273 
<212> DNA 
40 <213> Homo saoiens 



-1 - 




wo 00/75279 
<400> 3 

ggcctctcac taactaatca ctttcccatc 
gatcattgct ttttctcttt acaggggaga 
gaaaatggaa ccactagagg aatataatgt 
5 cagcccttga caaaattgtg aagttaaatt 
tccactatgg caactaactc actcaatttt 
cttctttgct tctccaacca aaacatcaat 
tttggtctac tctatcacaa ggccagtacc 
tgagaggcta aaactcatca aaaacactac 

10 ttaccttttc caaatcccaa tccccaaatc 
ttttaccctc catggtcgtt aaaggagaga 
cacagttata catgtctatc aaacccagac 
aacataggga tgaagtaagg tgcctgaaaa 
aagttatttt atatatataa tatatatata 

15 tgtgtgtgtg tatgcgtgtg tgtagacaca 
agccattcta agagcttgta tggttatgga 
agattggcat atcattgtaa ctaaaaaagc 
ctaaaaataa taataataat gctaacagaa 
ctagtagaga ctttgaggaa gaatt^caaca 

20 aagaagttga agttgcctag accagaggac 
ccccgaagtg gagaagggtg cagcaggctc 
aagttgtcct tttctggttt cgtgttcacc 
tatcttgaat ctt 

25 <210> 4 

<211> 38059 
<212> DNA 

<213> Homo sapiens 
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ttttgttaga 


tzttgaaCata 


tacattctat 


60 


atttcatatt 


ttacctgagc 


aaattgatta 


120 


gttaggaaat 


tiacagtca^^ 


tcCaagggcc 


180 


ctccactctg 


tccatcagat 


actatggttc 


240 


ccctccttag 


cagcattcca 


tcttcccgaC 


300 


gtttattagt 


tctgtataca 


Qtacaaaatc 


360 


acactcatga 


agaaagaaca 


caQQaat aac 


420 


tccttttcct 


ctaccctatt 


cctcaatctti 


.480 


agtttttctc 


tttcttactc 


cctctcticcc 


540 


t QQcrQ aci c a t 


cattctgtta 


tacttctgta 


600 


ttgottccat 


aatQciaQacti 


tgcttttcag 


660 




aaagtt tctt 


tcagagagtt 


720 


aaatat:ataa 


Catacaaba^ 


aaatataCag 


780 


cacgcatiaca 


caca^ataa^ 


ggaagcaata 


840 


ggtctgacta 


ggcat gat 1 1 


c acgaaggc a 


900 


tgacattgac 


ccagacatat 


tgtactcttt 


960 


agaagagaac 


cgttcgtttg 


caatctacag 


1020 


gtgtgtcttc 


agcagtgttc 


agagccaagc 


1080 


ataagtatca 


tgtctccttt 


aactagcata 


1140 


aaaggcatiaa 


gtcattccaa 


tcagccaact 


1200 


atggaacatt 


ttgattatag 


ttaatccttc 


1260 
1273 



30 <400> 4 

gtatatctag aaaaccccat tgtctcattc 
cagcaaagtc tcaggataac aaaatcaatg 
atagcagaca aacagacagc caaatcatga 
gaataaaata cqtaggaatc ctacttacaa 

35 acaaaccact gctcaatgaa ataaaagagg 
catgggtagg aagaatcaat atcatgaaaa 
tcaatgccat ccccatcaag ctaccaatga 
taaagttcat atggaaccaa aaaagagccc 
acaaagctgg aggcatcatg ctacctgact 

40 aaacagcatg gtactggtac caaaacagag 
cagaaataat gccacatatc tacaactatc 
aatggggaaa ggaatcccta attaataaat 
gaaagctgaa actggatccc ttccttatac 
aagacttcat tgttagacct aaaaccataa 

45 ttcaggacat aggcatgggc ttggacttca 



caaaatcacc ttaagatgga taggcaactt 60 
tgcaaaaatc acaggcattc ttatacacca 120 
gtgaactccc attcacaatt gcttcaaaga 180 
gggatgtgaa ggacctcttc aaggagaact 240 
atacaaacaa atggaagaac attacatgct 300 
tggccataat gcccaaggta atttatagat 360 
ctttcttcac agaattggaa aaaactactt 420 
gcatcgccaa gtcaatccta agccaaaaga 480 
tcaaactata ctacaaggct acagtaacca 540 
atacagacca atggaacaga acagagccct 600 
tgatctttga aaaacctgac aaaaacaaga 660 
ggtgctggga aaactggcta gccatatgta 720 
cttatacaaa aattaattca agatggatta 780 
aaaccctaga agaaaaccta ggcaatacca 840 
tgtctaaaac accaaaagca atggcaacaa 900 



-2- 
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aagccaaaat tgacaaatgg gatcaaatga aactaaagag ctcctgcaca gcaaaagaaa 960 

ctaccatcag agtgaacagg caacctaaag aatgggagaa aatttttgca atctactcat 1020 

ctgtccaagg gctaatatct agaatctaaa atgaactcaa acaaatttac agaaaaaaac 1080 

aaacaacccc atcaacaagt gggtgaagga tatgaacaga cacttctcaa aagaagacat 1140 

S ttatgcagcc aacagacaca tgaaaaaatg ctcagcatca ccggccatca gagaaatgca 12 00 

aatcaaaacc acaatgagat accatctcac acaagt:taga atggcgatca tcaaaaactc 1260 

aggaagcaac aggtgctgga gaggatgtgg agaaatagga acactttgac actgttggtg 1320 

ggactgtaaa ctagttcaac cattgtggaa gtcagtgtgg cgattcctca gggatctaga 1380 

cctagaaata ccatctgacc cagccatccc attattgggt atataccaaa gtattataaa 1440 

10 tcacgctgct ataaagacac atgcacacgt atgtttattg cggcactttt cacaatagca 1500 

atgacttgga accaacccaa atgtccaaca atgatagact ggattaagaa aatgtggcac 1560 

atatacacct aggaatacta ggcagccata aaaagaaaat gagttcatgt cctttgtagg 1620 

gcatggatga agctagaaac catcattctc agcaaactat cgcaaggaca aaaaaccaaa 1680 

caccgcatgt tctcactcat aggtgggaac tgaacaatga gaacacttgg acacaggaag 1740 

15 gggaacatca cacaccgggg cctgttgtgg ggtggggggc gaggggaggg atagcattag 1800 

gggatatacc taatgctaaa tgacgagtta atgggtacag cacaccaaca tggcacatgt: 1860 

atacatatgt aacaaacctg ctcgttgtgc acatgtaccc taaaacttaa agtataataa 1920 

taaaaaaaag atcattctaa aatttataca agcccttaga acagttaaaa atatcttacc 1980 

aaaagaagaa taaagttgga ggaatcactc tacctaatat aaagtcttac tacatagcta 2040 

20 cagtaattat gacagtgtta tattggcaga gggataaata catcaatggc acaaagaata 2100 

gatagagaaa ctggaagtag acccaaaaca atatggttaa ctgacttacg aaaaaatttc 2160 

agaagccatt cagtcgagga aggatagggt ggtattgttg ttttttgttt taacaaattg 2220 

tgctggataa attggacata cctatggaaa aaaaaatgaa gtttgaccta aacatcatac 22 80 

tttacacaaa tattaactca aaatggagca tgggcataaa tctaaaactt caaactgtaa 2340 

25 aacatttaga aaaaaatagg aaaaaaacta tcaggatcta gtgttagtgg aagagttcta 2400 

aatgtgatcc ataaaacaaa aacaaataaa ctggactaca tcaaaactaa aaaattctac 2460 

tctgtgaaag acctaattaa gaggacaaaa gacaagctac aggctggaga caatatattt 2520 

aatccacgta tctatgaaag gattcatatc tagaatatat aaacaacctt aagaatctga 2580 

cagtaaaaaa aaaaaatcag actaactgga ccactcatac attgctgatg gaaatgtaaa 2640 

30 gtggtacagc cattttggta aacatcattg ctctctgaca aagatacggt gggtcccact 2700 

gatgaactgt gctgccacag taaatgtagc cactatgcct atctccattc tgaagatgtg 2760 

tcacttcctg tttcagactc aaatcagcca cagtggcaga agcccacgaa atcagaggtg 282 0 

aaatttaata atgaccactg cccattctct tcacttgtcc caagaggcca ttggaaatag 2 880 

tccaaagacc cattgaggga gatggacatt atttcccaga agtaaataca gctcagcttg 2 940 

35 - tactttggta caactaatcg accttaccac tttcacaatc tgctagcaaa ggttatgcag 3 000 

cgcgtgaaca tgatcatggc agaatcacca ggcctcatca ccatctgcct tttaggatat 306 0 

ctactcagtg ctgaatgtac aggtttgttt ccttttttaa aatacattga gtatgcttgc 3120 

cttttagata tagaaatatc tgatgctgtc ttcttcacta aattttgatt acatgatttg 3180 

acagcaatat tgaagagtct aacagccagc acgcaggttg gtaagtactg gttctttgtt 3240 

40 agctaggttt tcttcttctt catttttaaa actaaataga tcgacaatgc ttatgatgca 3300 

tttatgttta ataaacactg ttcagttcat gatttggtca tgtaattcct gttagaaaac 3360 

attcatctcc ttggtttaaa aaaattaaaa gtgggaaaac aaagaaatag cagaatatag 342 0 

tgaaaaaaaa taaccacatt atttttgttt ggacttacca cttt:gaaatc aaaatgggaa 3480 

acaaaagcac aaacaatggc cttatttaca caaaaagtct gattttaaga tatatgacat 354 0 

45 ttcaaggttt cagaagtacg taatgaggtg tgtctctaat tttttaaatt atatatcttc 3600 
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aatttaaagt tttagttaaa acataaagat 
accaaagcct tt^catggatt aggaaaaaat 
tttgattatt tggggaaaca caatactcag 
aagaattgac ataaagagta ggaagttagc 
5 caactacagt gactttatgt atttcccaga 
ttggacaaac agcatgttct cacaggaagc 
aatcaaatct agtagctgac agtaccagga 
aagctgactg gccctgtggt tcccactcca 
aatgctgaaa taacgataaa aaaaaataca 

10 agggataaag aaaatttgtt ggaaaactca 
tatatctcat tagtgtgaca tctg^gagga 
9tggtgagat gatgaaggtt gtaagaggct 
gaaaggatag ataagaataa ggatgaacct 
cgataaacct gaagggaagt taagtatacg 

15 tgtgattctg cagccattgt agccagagat 
tttttctaag gatagacagt attaatgcag 
cttctaaaag gagtttcggt gagtgatttg 
tttaggttat tattgcaaca gtttggaatt 
ttagttttgt aaagtgtatg catcaaagat 

20 ctacggtgcc aggtactgtg tcagggtact 
ttgatctaaa gcagcatgag gccaggtgag 
taggtactaa gggatcatag agaaaggaac 
agaatatttt catctgggtc ttaaaagatg 
tataagggca ttttatgcaa agaaggatca 

25 tgaggtattt caggagtttt gtatggttcc 
ctaggcagag ctggtcatca gataatgaag 
tgcaatatgg aggtatcgaa gtataaacat 
gcccagtctg gcaacacgct aatgaaatga 
gagagagcag tcctgagact attgcaatta 

30 acagtagcag tacaaaaaaa gaggggagtt 
gacttaacca tctgggtatg gaaggggaaa 
tgtgattatg gaccacagaa taatgtctaa 
gtctgaatcc tgaatgtttt agtatgttac 
tcttcctcat ttataaaaat aagatgacag 

35 aagtgagata atggatgcaa agttactgag 
tatacattag ctattactgg ctacattatg 
tgaaattcag gagacgtatc tgactatagg 
agtagttaca gcacgtgtgt gggcatctgg 
acagtgcctg gcagtagtgg ttgtatgctc 

40 actagactgt ggtaagcaag gcctgtaggg 
aggacctctt aaatattgtg ccctacatgc 
gcaagtatat ataaggtgaa aaaggaaaaa 
catttaaggg ccatggagag gaacaggagt 
gagttagagc aaagcgggga accagaatag 

45 agggagagat caacaattag aaattattta 
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tiaacctttca ttagcaagct gttagttatc 3660 
cattttgtct ctatgtcaaa catcttggag 3720 
ttgagttccc taggggagaa aagcaagctt 3780 
taatgcaaca tatatcactt tgttttttca 3 840 
ggaaggcata cagggaagaa attatcccat 3900 
atttatcaca cttacttgtc aactttctag 3960 
tcaggggtgc caaccctaag cacccccaga 4020 
gacatgatgt cagctgtgaa atcagactga 4 080 
gaggttaaac tagcaaagtg agtaaagtca 4140 
caaagcagga cataaagcaa ggccattaga 4200 
caaagcatcc aaaccctttc ttctatataa 4260 
tctgccccct tgaagacttc agatgctggg 4320 
ggcttttgga gcctgggaaa taatgactag 4380 
atccccagat aatactaagg agaaaggcaa 4440 
aataagccct tgaggaaggg gccaggggaa 4500 
cactctcttc tgctattaaa ctctcattgg 4560 
ctgagatgtt tgcattttca tgctgctgcc 4620 
ttgaaattaa aacagttctg taaaaccagt 4680 
gtccttcatt cagacattac tgagttacaa 4740 
aggggtatgg ggataaacca gactccctct 4800 
aggtttcaat atatgtgata aaatgtgcac 4860 
acattaaatg gggaaacaat tgatagagag 4 920 
agtaggcgtt ctctctcttt aaatgtctga 4980 
ctcgtgcaaa gactcagctt tgcaagaacg 504 0 
atatggacta tgacaagtga gacaggtaaa 5100 
tcattaacct aaggagattg gacaataaaa 5160 
aaggagtacc actgatggct gatttaggat 5220 
tagtggggga gggggccgta ccaagactag 5280 
Cctgcgggag acataaaggc tagaacctga 5340 
caaatgatat taaggaagta gaagtggtat 5400 
tggctagagt cttggggact ttgtgtttga 5460 
gagaactggc tctttagtct gactgccaga 5520 
cttgcaaagc ccttagcctc tatgaatcta 5580 
tgcctatctc gtgggacttt tgtgaggatg 5640 
cacagtgtcc aacacagcag aagcattaca 5700 
atatacagtt agggagttgg aaagataatc 5760 
tgagtatttg gaactcattg ttctgtaaac 5820 
agagtgagca tggatattgt gatacctagt 5880 
agtaaatttt gttgacaggg tcagggccgg 5940 
cataaatata cttgtatgcc ccgagaagtg 6000 
cttgtttggt tcactcttgt cccagcccta 6060 
gctgaggctg gagcctggga gaaccctgga 6120 
taatcaattc aagtgctgga tggataacag 6180 
agtgattatt ataaaaagag tttcctaaaa 6240 
gagcagccag taaatacata aactcaaaat 6300 
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tattctttag gtcattcctg attgtgacaa 
gaaggaaaag agctacacag aagttattaa 
agaagaaagt agggccaaca tgaaaggagt 
ctcactgatc ctgccaacac ctgtgagata 
S atttacattt gacaaagtta aggttcagag 
agaggaggca gatacttgat tcaaacctat 
tttcactcaa ccacactgta cagtcagctc 
ttaactgcag atcaaaaata ttcaagaaaa 
tacagactct tttatctttc attattccct 

10 gcatttacat tgtattagct attaagagaa 
atgtgtttag gttatatgca aatagtaagc 
agatcttgat atttgcaggg ggtcttgcca 
gtaaatggat gcaggcatgg atgctattta 
tgagcagaga ggtgggtgga ggctgtgagg 

15 ttggtgattt ttttccttca cagtcctcgg 
ggctgctcca atttaggggc taggattgca 
gaaattgaga aatatggcca atgaagagtt 
tataatccca gcactttcag aggcccaggc 
accagcctgg ccaacaaggt gaaatggtga 

20 agctgggcat ggtggcgggt gcctgtaatc 
tagcttgaac ctgggagatg gaggttgcag 
ctgggcgaca gagcaagact ctgtcaaaaa 
aaattcaaag aaaaaaagtg agcccaaaag 
gtcaagggac tagaacactc cataaagtga 

25 ttagttcatc tccgtttctt tcctttcctt 
tagtcaggaa gtagtcccaa ataccccaga 
gagatggtat cagatatatt gctagaccct 
tgagtccctg ttattaattt tatacataca 
caaacacatg tgtgtgtata cagccatgca 

30 atgtgtcatt aagcaatttc atcattgtgc 
aaatggtata gcttactaca tacctaggtt 
ctgggctgca aacctgtaca gcatgtgact 
ggtatttgtg tatctaaaca tagaaaaggt 
ggaccactgt cgtataatgt ggtccatcat 

35 tatctcatat atatatatat atgatatgat 

tatacatata tatgtgtaca tatatgtata 
catatatgta tatatatgta cacacatata 
tatatatgta cacacatata tgtatatata 
gagaggagag gagaggaagg agggagggaa 

40 tcctccagag ttcaggagtg tctcttcaga 
cctggaattc tagagagatg cttaaatcac 
gtgctgttga tctgaaattg cttttttaaa 
tgacagacac tggggtcaca tttgcagctg 
acatttgttc aaagtcattt gggcaaccat 

45 atcactttgc aaagatcctc aatgagctat 




PCT/USOO/15728 



tagtcatttc attatataaa tgtgattaag 6360 
agagctaaag agaattgaga aatttaaaac 6420 
agggagaaaa agagataacc agcatattgt 6480 
gatattctta ttatactact gttaaaccta 6540 
cttgtgtgac ttgtccaagg tcacaggtct 6600 
ttctgtctga tctgattcta aagtctgttt 6660 
tccttgtgag ttccacagcc acagattcaa 6720 
aaatggatgg ttcgatctct actgaacatg 6780 
aaacaataca gcataacaac tatttacata 6840 
acctagagat gatttaaagt acaaaggagg 6900 
catttttata tcggagactt gagcatccac 6960 
ccaattttcc atggatactg aggaacgact 7020 
ggagtgtcca gggccaagta aatgagttgc 7080 
catcaatatg tggtggcatc catctgcatt 7140 
ctgtctggga agagaaggat gaaggcagat 7200 
gggtgggcac agcattgcaa acgagtgaag 7260 
gaagagaggc ctggcatggt ggctcacacc 732 0 
aggcagatca cttgaggtca ggagttcgac 73 80 
aaccccggct ttactaaaaa tacaaaaatt 744 0 
ccagctactt gggaggctga ggcaggagaa 7500 
tgagctgaga tcgcaccact gcactccagc 7560 
aaaaagagtt gaagagaaaa agtctaggct 7620 
gaacttgcag agcaagggaa aagcagggat 7680 
acagctgcaa tgaaaataag ggaagaaagt 7740 
tttactttcc tttctcttcc tttttggagt 7800 
aagttcatct tataagccct tggtcctctt 7860 
tgaagaaagg aacaactcca ggcaacttct 7920 
cacacatata tgtatataca tgaaaacaca 7980 
ttccttaaca atggggatat attctgagaa 8040 
gaacataata gagtgtactt acctaaacct 8100 
gtattgatgt ggcctattgc tcctaggctc 8160 
gtactgaaca ctgtaggcaa tggtaacagt 822 0 
acagtgaaaa tacagtatta taaccttatg 8280 
tgaccaaaat gtcattgtgc agcaaatgat 8340 
atatatgata tatatgtgtg tgtatatatg 8400 
catatataca cacacatata tatgtacaca 8460 
tgtatatata tgtacacaca tatatgtata 8520 
tatgtacaca cacacacata gagagagaga 8580 
ggagaaatat gattcagata gagacatcta 8640 
ctaggtagat gtagcttaaa aaaaacatat 8700 
tgcaattcct ataacacttg ccaaccaaag 8760 
ttaatgcagt gatttttctt taacatctag 8820 
gaccataatt aggcttctgt tcttcaggag 8880 
attctgaaaa cagcccagcc agggtgatgg 894 0 
tttcaagtga tgacaaagtg tgaagttaac 9000 
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cgctcaattt gagttctttc tttttcatcc aaagtaaatt caaatatgat tagaaatctg 9060 
accttttatt actggaattc tcttgactaa aagtaaaatt gaattttaat tcctaaatct 9120 
ccatgtgtat acagtactgt gggaacatca cagattttgg ctccatgccc taaagagaaa 9180 
ttggctttca gattatttgg attaaaaaca aagactttct taagagatgt aaaattttca 9240 
5 tgatgttttc ttttttgcta aaactaaaga attattcttt tacatttcag tttttcttga 9300 
tcatgaaaac gccaacaaaa ttctgaatcg gccaaagagg tataat'tcag gtaaattgga 9360 
agagtttgtt caagggaacc ttgagagaga atgtatggaa gaaaagtgta gttttgaaga 9420 
agcacgagaa gtttttgaaa acactgaaag aacagtgagt atttccacat aatacccttc 9480 
agatgcagag catagaatag aaaatcttta aaaagacact tctctttaaa attttaaagc 9540 

10 atccatatat atttatgtat gttaaatgtt ataaaagata ggaaatcaat accaaaacac 9600 
tttagatatt accgttaatt tgtcttcttt tattctttat agactgaatt ttggaagcag 9660 
tatgttggta agcaattcat tttatcctct agctaatata tgaaacatat gagaattatg 9720 
tgggtttttt ctctgcataa atagataata tattaaactt tgtcaaaagg actcagaaag 9780 
atcagtccaa ccctctaacc catattggat ggtgatatac tacagggtta tgccagtgtg 9840 

15 ggaactatcg ctggtaaata agtttaatcc tccctagggc ttcacaaaga acattgttcc • 9900 
accccaggag ggtggaagga agaaactgaa atgattgtgt cttagaacct aatgaaagtt 9960 
tgcattcctc agtaaaatca gagactgctg attgacttaa atgtttatag cttcaaagtc 10020 
ctcctcatta tcatggccca gaagcccttc catgattgtc cttccccacc ctccccatta 10080 
cccttcttgc ctcctctgct acttctctcc tcgcacactg ggctccagcc accctggcct 10140 

20 tcctgtcact tcttgcacac tctaggaatg ctcccacttt ggaggcttta tctggctgtt 10200 
tctcttattt ggctgttccc aacttcctgt gggctgactc cctcacctcc ttcgggtctt 10260 
tgcccaaatg ttaccatctt aatgaggcct accttcacca tctattaata cttcaacctg 10320 
ccccagtagc cttaccactc tagacacctg tacagaactc cactctactt tttaacagag 10380 
cttttcacca tctaatgtat catataattt cttactcata ctatttatca tttattttct 10440 

25 cctactccac taaaatgcaa gtttcatgtt ggcagggata ttcaattgtt ttgtttattg 10500 
atatattcct agcacctaga acagtatctg gaaaagaggt actcagtaaa tatttatcaa 10560 
atgaattgac caaaagaagg aaaactcaaa actttaatga caactaactt taaagctaca 10620 
ataacttaaa attcagagta ggattttgag ggagggtaag tttcaaagat tgacttacct 10680 
aagactatct gcataaataa aaagaaatta atccagacaa caaattcacc aact:t:ccatc 10740 

30 aattggaaat ccaattctat tttctacagt ttatgttctg gagacactac tggacactct 10800 
tttactctca taactcataa ctcctccact tttgttttta aatcatgaga gaaaaagagt 10860 
tgactctgtt atattgtttt atctaccttt ccttgatctt agaaacgaat actaccatac 10920 
cagcttctac tgaggtgccc cctaaagtta gtccaaatag gtctttgcaa tctccattcc 10980 
cgcagaattt agaactttga atcacatgat ttatttctaa aagtaaatcc atgccgattt 11040 

35 tccccaccaa aaaattcctg actattaaac tcctacaatc ccttcattgc tcactcccca 11100 
cccccaggat catattttaa agttgggccc ttgccttttg ggtcacatag gtacactgtt 11160 
tgctatacca caggtatagc tatctggaaa acatggaggg tattattctg ttactactgc 11220 
ttcgtcaacc aaaaaataaa acaaaacaag aacaaaaaag aaacaaactc cctgcctctt 11280 
ttcacttgca gtcaaggttc ctaaccacta caaaattagc ctatgtttct tcttgcacat 11340 

40 agtagaaacc caagcttctc actgctgtgc tattctgtac catcaactca tcacataaag 11400 

agcctggttg aagaatgatt gtccaaccac attactagca tctgtcaaga ctttccagtt 11460 
tacaaaaggc ctatcacatt taaccctcac accatccttg tgaccaaagc attattaact 11520 
ccattttaca ggagagtaaa ctgaagctta gggaagttaa aagaactgcc aaaggtctcc 11580 
cagttgggga gtcatgaagc ccagaagaga agccaaattc tctgctgctc aaccccttgc 11640 

45 tttcactatt acacctcagg gccttcaaat ctaaatgcag ttattcatta aacaggaacc 11700 
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tggtagtctt aaacaggaat ctctcacttg 
ccaactgtct atggctttgc ctgaacccaa 
ccaaatgtgg gataaaatga cactcatttt 
gtagctggct gaaagcccag accctttcag 
5 gtctataata taaagggcaa atgtagctca 
acaaaccaga gataaccgat tttgttttca 
aaaagcatat cagatctagg agcagagggg 
• tccagggagg gactacttat gataaaggga 
acacgcttct gacagagaag agggagaata 
10 gatttgcttt aaaatacgac tgataattag 
tctttgattc ttattgttat ctttatctct 
acagatcaag ctcccatttt tttcttcagg 
ctcccttctg gttctattca tccttctctc 
cagagcacta agaaaccaga cttaaagatt 

15 ■ ctattccttc ctcctgtttt cttaccatca 
aatgcagaaa cttcaagaaa ggcagaatgg 
acactgtaga atctttttaa attctgtatg 
agaccatttg aaaccgaatt tgtaaaacat 
tgacatgttt tctattgtct tgaaccatta 

20 atacaagaaa gcagattcat ttgctccctg 
ccatatatta tctagctcag tttgctctac 
gagataatat ttgttgaagc aattaaattt 
aaggaaggat tgaaaatcct tctcaccctg 
gtccttctcc ctcttccctt ggattgcata 

25 tttctaggac agtgtcagcc taaggatctt 
acaggagcat catatgccta taggcagctg 
taaaatcaga ctcccatccc aatgagtatc 
ttacgtgcca attcaatttc ttaacctatc 
catgtttaaa tggcggcagt tgcaaggatg 

30 ttggatttga aggaaagaac tgtgaattag 
tcaaagtttc cctctgaaac aagttgaaac 
tttcttaaaa acataccttt gatgcttata 
atatgagttc aagaagctac atCaaaatca 
ataatgatgt tccgactcac cttattaatc 

35 cattatgcgc attctatgca gatgagaaaa 

gcataaatgg tttagacagg acttaaactt. 
ctatattcaa aactcagaga aaactgaacc 
tcaacatagg tgaaagtcaa ttaagtacag 
ataatgggaa ttctccacat gtacaaacca 

40 aaagctcatc cctgaatttg actatattga 
gaatccttgt gtaccttttc ttctcaaagc 
taattggagc agtaaacccc agtgtccctt 
attactggta ctgtgattga gaaacgcaca 
tgacactctg gaagaataga gatgcaatcc 

45 gactaaggat aaagagtaga aaattagcag 
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gtaagatctt gtctcttgtt gtatttgacc 11760 
agtacacaca gcctagaaac caaaggagaa 11820 
aacgacatgt ctcagcaaat gagttcctgt 11880 
taaaacatcc tgaataattc acatttgttg 11940 
tttttagacc agttctgaac atcaatagta 12000 
tagaattgga acaaattaga gtatctgtgc 12060 
acaaggtcta atttttaaat aagcaaattt 12120 
tattagtctc ttagtcaacg gaacctggat 12180 
ggcaggaatc tacacaccag atgtcaagga 1224 0 
aaatttctca gtttccccct tttccctcat 12300 
tactcctttg tttctcatat attgagtctt 12360 
ggtatttttc tagttcaaag tgcctaccat 12420 
ccaaagctcc tttagaagtg tggattaagg 12480 
cccttctcat tctgactttt ctcctttcac 12540 
gtgtcttcaa aggctttcaa gtacacggta 12600 
aaacataacc aatgcataca taaataaagc 12660 
atatatcgaa tgctgtctct cacattacct 12720 
agactatctt taagtagtaa cagatgcttc 12780 
ctgcatatga tacatcaaag ttaagtgaca 12 840 
cctaggccgt cagttcctaa agtggaaacg 12900 
aagacctgca atagagcctt gtgtgacata 12960 
gacttggaat taactctgcc atcattctat 13020 
tgctgatata gtacctttct atacaaaaac 13080 
aactatgtac atgccttcct caggggcact 13140 
tgtttgggtg gcttttagaa actcaggaag 13200 
gcttccaggt cagtagtttt gctctgaccc 13260 
tacaggggag gaccgggcat tctaagcagt 13320 
tcaaagatgg agatcagtgt gagtccaatc 133 80 
acattaattc ctatgaatgt tggtgtccct 13440 
gtaagtaact attttttgaa tactcatggt 13500 
tggaaaatgc aatattggtg tatcataatt 13560 
aacatttcat ttgtagtgat agttttcagg 13620 
ataacaatat ttggtaacta atattaagta 13680 
tttaatacaa ccgtatgtgg ttagtactat 13740 
ccgcaactcc aacggccaaa aattacagag 13800 
cagtgtgacc aaaacccatg cttctaacta 13860 
cagaaaattg aaatcatgac taaattgcta 13920 
aactggagta tgactggcca attatcccat 13980 
cttcatatgc taaacttgtt gacaacattc 14040 
ttacatcgaa aatgttacat agcaacctta 14100 
ctagattatt tctttttccg acgttttcag 14160 
acctacttgt ttattacctc cagatgcaat 14220 
cagtgctaat gaggaattca ctttctactc 14280 
taaggaagaa tttaacacca caggctacat 14340 
gactctatta accgattaca gcaatccacc 14400 
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tgacagatga aaaaggcatg aaatgaaatg 
aaaggaaaaa agtcacctgt aatgttgttc 

gaccatcttc ctttagtctc gaaaatttct 
gcaaagacat: ttgaaaagaa agaaaagagt 
5 ccattcccct taggtttcca ttgtggggga 
agtgtgttaa ttgattcaat atctctgaag 
aaattcagat ttacctggat aaatttgatc 
ggagtagtta ctcattatca gcttaaataa 
ctcaattagt ctcagttaca actgaagggc 

10 aatatcttta cctagtttaa aaaataaaga 
aggaatgtaa atataagcac aaaacctcta 
tccacccagc aaaaatgcta agtctacaac 
cattttggtt tggtcaaggc tgtgcagttg 
acagtatatt gatctgaccc accaatctga 

IS ttagctcaca aacaaaatga gaattcccac 
aagctacagc aaaagcataa acaaattacc 
aattgcaaca gctactagat agcacagttt 
ttataaggtc ttgaaaaaaa gaaaggagat 
ttcccttgtt catcaggccc ttcactgcga 

20 tctagttaaa gaatattaat tggctattgg 
tgatgtccct agctgtacat caaacccaaa 
atttggtcat tcctgtctct tctaaagagt 
atggaaagaa aacaaaaatc acaagcctat 
acagacacta ctgtcttatt tactacgtat 

25 agt:ttagctg aagcacagca tattttgtgg 
aagctttggt tgtcacacaa gtggatatat 
cagaacatac taacatttgt agtaggcatg 
ggagggaggg agaacagatg ctttgagatg 
ccccttcggg accagagaga gcccatatcc 

30 aattgataaa tgagtcccat tctaactgtt 
gaattccttg tgccattatt ttatttctgg 
attgctgatt aggaagcaat atttcccacc 
cagcagcaac atactgagcc ctaaagggca 
tggttacttc ttagccaatg acacagaatc 

35 cccattgtgc atgccctgga caaaccaagc 
attgacggaa cactttctac aggtaatgtt 
gtagcaacaa aatgatagct agtaacagaa 
aggagaaagg ccttttaatt aattaattaa 
ggatcatgcc cttagtggat tattggtagc 

40 ccccctggca acagttgatt tgcctccctt 
caattattac atttggctat gctagcatat 
cactcaaaaa ttcatattct ccaaaaccat 
gaaaaaaagt cacaagctta ttattaacat 
aagatgaggc agaaaggaaa agaaagccaa 

45 cttaacagca actgattgct taacttcctg 
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aaatgtagca gctacactcg tcctattgag 14460 
agaaatcctt tcagtactaa aaaattcatt 14520 
tagaaggtaa aaaaaggaaa aggtgacagg 14 580 
gaatgaactt gcacacctgg cttggactcc 14640 
caaactaatg cctgggttac ctttcttgag 14700 
tgctactttc atctgaaagg ttataatttg 14760 
ttgctattat ggaaacctct agaaatcctt 14820 
tatagccggt ggagctgagg gaatgagtaa 14 880 
acattgttgt aaactataat tgaaaacata 14940 
tgctttaaaa ggaggaaggg aatagccctg 15000 
caacagagtt tgctacgtgt gtggctgtgt 15060 
tgacacaact tggatactct catgttccca 15120 
tactgcaggc caccaccact cctggcctct 15180 
tcaaggttta gaaaaatatt ttcagcccag 15240 
aaattgctct ttatctcaga caacagagga 15300 
atttaagttt gttgcttcaa attaaagact 15360 
atggggcatc tcggccccaa gtcttttgtc 15420 
tttcatcaat aagagttttt tgttatcttt 154 80 
gagagaggtg taaacgttca gggcatgcat 15540 
gtccctttgg ttagaataaa gacccctgta 15600 
tatctctcag ataaatgaag gtctgtaaga 15660 
aacagaggca ttttcccgca gtaaagtaga 15720 
aaacaccttc ttcaattttc ccagcatgtc 15780 
ttctgaggag taaaaaaagg aaatatgttg 15840 
taaacttgtt aaataaaaca tcttttgtcc 15900 
caggaaatat aaaggcagaa taaactaaag 15960 
aagggaatta gaaagtgttt gtgttaacat 16020 
ttcttcaaca gatattctag gcactgagac 16080 
accacagtac ctgacacata aatgctcagt 16140 
ccttagccct gctctatgga actctcccct 16200 
aatcttcagc cttttagctg agggcaaaag 16260 
tcctgcgcaa aacaagccaa agatcaacag 16320 
atgacaaatg tggagaatga t:acagaggt<: 16380 
acaattgaga aaacacagag tttattcatt 16440 
tgcacctttc gtaacttatc acaatctcat 16500 
tgatttggct gaacacttta gcattgcttc 16560 
aaagatccag ggatattacc actgttagtg 16620 
ttaattaata ggaccaagtig ccatcttttt 16680 
aaaggttaaa gctcaagctg gttcctttgt 16740 
atctcctgaa gtaccgtaag gactaagagc 16800 
gtaaaataga gtttaaaagt ttagattcat 16860 
acagtcactc tgttagcctg tgttccccca 16920 
gtgcaatcca ggggcaagag aaaggaactg 16980 
taagaggatg agttatcaaa ctacccgttc 17040 
ggactgtctc caataagtca aattggcctc 17100 
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d99ttagtcc acctgagtgg 
tcattggtta gaagttcgac 
cttgggtacc agaggcatat 
aagacacata gtgcagctat 
S tgacatccac ccagagctaa 
gaatgaaggt ggtgcatagg 
tatgattgca ctgaccactg 
acaaggggaa aggtgtctta 
gctcaagagc agagcctgtt 

10 atggtacttc ctgttattgg 
aaatacaaac aagatggcag 
tgggattaaa ctaatctatt 
tatactgtta acgaaagtga 
atgtgaaaag aactatcagg 

IS tcagacaaaa tagaatccaa 
ataaggggaa ttgcatagga 
tgacatattg gtctttggtc 
tgatatgtaa ccacctcagc 
actataattt cttttttttt 

20 gttacataga taaacatgtg 
attcagctca gtaccaaata 
ctcaagtaga ctccagtatc 
gctcccgctt acaagtgaga 
aggatgatag cctccagctc 

25 atggctgcat aatattccat 
gatgggcatt taggttgatt 
cagattctac ttttataggt 
tagaataact aagcaataga 
tatgaaattg cggttgcttt 

30 gaaaaaatga ttgtacatat 
tagaaagatt actggataca 
caacgattca aaaatgattt 
gagaaagatg tgcaagaact 
gaaaacctaa ataaatgaat 

35 atatatcaaa tgtttactaa 
gtggtgggag gtcgggcagg 
agccaaggat agccaagaca 
atgtcaagac ttattatcga 
gacaaataga tcagtgaaac 

40 gcttgattta tgatagaggt 
aaaagtgata ggtcaattag 
attcataact tgcagaaagc 
aaagattgat aaactggact 
ttcgactgaa aagacaagtc 

45 actgaacccc atacagtgat 




gaagaagcgg tgaaagaatt 
ttatggggaa ttaactccct 
ggcatccatg ctcagcatga 
gagccaaggc aattcaagga 
tcaccaccat gctggaaaaa 
aggtatctaa tacagtcact 
aggatttcta ttgaaagttt 
cattgtttat gttcctgtgc 
t.ttcttaatt cagcaggtct 
tattgcatag gagaaacaaa 
gaataagcca aaaatatcag 
aataatgaca actttcagct 
tacctaaaat aaaattacac 
taaaaactaa caaaaagaaa 
gaggaaaatc atttcaaaag 
gagtaaagaa aatgtgggcc 
ttcagttacc ttacaggacc 
cagcttcaag ttgctttttg 
aaatgtttta ttttaggttt 
t cacagggg t 1 1 gt t gt ac a 
gtgatctttt ctgctcctct 
tgttgtttcc ttctttgtgt 
acctgcagta tttgattttt 
cattcatatt cccacaaaag 
ggtatatatg aaccacattt 
ccatgtctgc tattctaaca 
aacctgttaa acagtctagc 
aattacactt caatgcagaa 
ttgtgttcac tgaaaaaaat 
agaaaaccca aagcatctaa 
gagtcaacat acaaatatca 
ttataatagc attaaaaatt 
ctacataaaa aattatgaga 
gaataggcaa tgtttatcat 
tggattcaat gcaataccaa 
attcataagc taattataaa 
gttttgagga agaataaact 
gttacattta ttaagacagt 
acactagagt gctcagaagc 
gccagtgcag tagagaagga 
atattcatat ggcatgaagt 
aaaaatttct taaaatacaa 
atattaaaac taaggactcc 
acagagtgag acaagatatc 
ggtgggaatt taagttcgta 
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tgtctgtcag tatctgtctc 17160 
cacatttcct agttggatag 17220 
acagggaagc ttcaaggcaa 17280 
tacacccata ggaggctggt 17340 
gacacaggcg aagctgagaa 17400 
cattttcaaa ctttccatgt 17460 
tactgttgtc aaacacgtac 17520 
tgctctagaa acagaaatag 17580 
aagctaacaa gtcctgaaac 17640 
gggaaagcac agtaattaga 17700 
gaaacacaat tattgtgaat 17760 
tggagttaaa aatttaattg 17820 
tgggaggcca aaatgaaggg 17880 
ctagcaaagc aatcttaata 17940 
acaagagatt ttttttatta 18000 
actggaatgc t-tagcactaa 18060 
ctatttcatt ctcttatgtt 18120 
gccctaatgg acttcctagc 18180 
aggggtacat gtgaaggttt 18240 
tattattaca tgacgcagat 18300 
gcctcatccc accctcctcc 18360 
ttataagttc ttaacactta 18420 
gttcctacgc tagtttccta 18480 
acataatctc cttcttttct 18540 
tctttatcca gtctgtcatt 18600 
ctgtaatttc taaagacttc 18660 
tctggaagcc aagcaatttc 18720 
aggcagtatc tacatgagat 18780 
aagtaaaact gtaactttca 18840 
acaattaaaa taaataagta 18900 
attgtatgtc tatataccag 18960 
agacgcttag taataaatgt 19020 
cgttattgag aaaaattaag 19080 
taaaggatac aatatagtaa 19140 
agtgccagca ggcttttttg 19200 
atgcatatgg aaatgcaaag 19260 
tgtactactt acactaccag 19320 
gtggtactga cacaaggata 19380 
acacctgtac atatataaag 19440 
aattattggt gttttcaata 19500 
atgaaacaat aacaatttat 19560 
aaagtgatca ccat:aaagga 19620 
tgttcagcaa aagacactac 19680 
tgcaatacag atacctaata 19740 
caatcatttt agaaaattgc 19800 
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ttggcagtat ctactagatc tgaacatgtg 
agtaaaaagg catgtttatg .tcaccaaaag 
tacataagag ccaaaaactg gaaacaaacc 
aaaagctgta atagtaatac agtggaatac 
S acaaaacaac atggtttaat ctcacagaca 
atattgcgaa cttctgttta taattcaaga 
tccaggtaat ggtaacctat aaaaaggaaa 
ctggggtctt gataatgtgc tatgtattgg 
tttgtgaaaa cttacactaa aattgtgtgt 

10 atagggtttt taaacctgta gttcataatt 
tttaaaccaa tcaattatag tgctaccatc 
acctttcttt ccactcttat ttcaaggctc 
gcaacatgaa tgcccccaat gtatatttga 
ttttagaaat gcatgttaaa tgatgctgtt 

IS atgtaacatt aagaatggca gatgcgagca 
• ggtttgctcc tgcactigagg gatatcgact 
aggtcataat ctgaataaga ttttttaaag 
acaaacctac ataattttaa ttcctacttg 
atcagtagct tgaattagac caattaattt 

20 • atgtaatcat ctacaacctg aattctttct 
catttatatc acaaagcaat taatttgtgt 
agtcaaatca atgtagtaat actatatcat 
ttctagtata aggacggtaa gtttigaagca 
gagaaagtta ttttttattg ttgggtctta 

25 attccggcaa tgaactgttt tatgttctgc 
gaacaaaaca atgtttccca gtcataccaa 
tattcagaac atctccactc catgttcgta 
cacttttatt tfettgaaaaa tttaggctct 
gcttttttga agcaaactag atataatttc 

30 aattggtaca ttaaattgtg caccatttct 
tataccaggc agaagaaata aagaaagaac 
taatcctgcg gcactagagg aattaaagac 
gaaatcaggg gtctcacagc cttcagagct 
tttattgaca gcaagccagt cataagattt 

35 gatgagtctg gctagttatc tgcagcagga 

aattgtctgt ggtttaagaa cacctttaag 
ccttgccctc attctggtaa acccacaacc 
catatcacag tgctgcagag attttgttta 
atttggaggc ctgttcccaa caaaccagaa 

40 aagaatctct aaggcttcag ggcctgccca 
ctgtctcaaa gctagtctac cttgagagga 
gtattttaac cttaaaaacc taacttccag 
tacaagttct tctatgctat aaaagagaaa 
tgttatattt gaatatagcc ttagctttag 

45 ttttctgcct tttcaatgat aaaggtccct 
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atccagtaat tacactcata attataagcc 19860 
atatatacaa gaatgttcat tacactatta 19920 
aaatatccat taacagtaga atgaataaat 19980 
tacacagcaa tgtaaatgaa ctactgctgt 20040 
aaatgttaaa tgaaagacac agacgagtac 20100 
actggcaaga actgtttact gtgttagaag 20160 
aagggtggaa tgattgggag ggggcatctt 20220 
tcagtttagt gtttaaacag gctcatttac 20280 
attttttgaa tatatgttat acattaataa 20340 
tagtgaaagt agaatatcca aacatttagt 20400 
atttttatgc attattgaga agtttatttt 20460 
caaaatttct ctccccaacg tatattgggg 20520 
cccatacatg agtcagtagt tccatgtact 20580 
actgtctatt ttgcttcttt tagatgtaac 20640 
gttttgtaaa aatagtgctg ataacaaggt 20700 
tgcagaaaac cagaagtcct gtgaaccagc 20760 
aaaatctgta tctgaaactt cagcatttta 20820 
aatctgcttc cttttgaaat catagaaaat 20880 
tctagattgc atcatatttt aaatataact 20940 
gtgtccaatt tgtccaattt ttttctctaa 21000 
gatttctgca tatgtatttg taattcatca 21060 
aaaatataca caaataattg agtgataggc 21120 
tgattctatc tgggctggct agtttactct 21180 
agctgagttt acacacttgg tgtcagaatg 21240 
taggctgatc agcacaatct atatggctgt 21300 
ccatgccacc attttaacag ctgattagtg 21360 
tggctgttat ctaaagatga aagcagtaga 21420 
gcagggtcaa ttatatttga taaatgaggg 214 80 
ttttgcattt ctaaagcctg atatcttatt 21540 
ctgtaactgt ttcagtacct gtctcagcac 21600 
cagtgccaga tcagcttggt caggagaccc 21660 
acacacacag aaatatagag tatggagtgg 21720 
gagagccccg aacagagatt tacccacata 217 80 
actgaaagta ttccttatgg gaaataaagg 21840 
acatgtcctt aaggcacaaa tcacttatgc 21900 
cagttttccg ccctgggtgg gccaggtgtt 21960 
ttccagtgtg gatatcaagg ccatcacgag 22020 
tggccagttt tggggccagt ttatggccag 22080 
gctaggaata tatatcctgc aaataaaatg 22140 
cttgttcttc tgcctggttc ttcacataca 22200 
gcatgaatat gtgtgtgggt gtgtgtctgt 22260 
tatagacaga tggcatacta gctaaaccct 22320 
cagaattgag aaccacctcc aactattaag 22380 
cagaataagt aggccaaact taaaataagc 22440 
tttctgtagc cattgttgat tgtgtacact 22500 
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tatacataag tattttgaac taatttcctg ttttctcaac cacttgctgt cttcatgata 22560 
ctttgtcgca gctggttgct atagaaatgt ctgttacaag gaatgtggct tgaaggaaag 22620 
tgataaatga aaatgaaatg tgaagtgact ttgtttgact acaaattccc attctggtag 22 680 
tccccagtgt atcaatacat tatttttctt tagaaaataa accaacccaa ggaaaaatgg 22740 
5 tgggcaggtc ctggtgaata tggctgtgat aattatatta gcaatctctt tggctaatat 22 800 
ttgaagccca aataattgaa tcacaatgat ctctccccag aaaatatata aaatgcacct 22860 
tggaatctag aaggcctttt agtctgcaaa agaaaccttc ttaatcataa gcagcagaag 22920 
tcccatttac caaattggaa agttaaagtt acaaagcatc aatcatcaga cttccattca 22980 
gggatggcaa ttgggagtaa gactttttag taaagaaact aaacacaaag tcattagact 23040 

10 ctgtaaaagt cttaccaaat ttgattctgg aacacctatt ctatttccgt aaagatgatg 23100 
aattcggagc caaatgttct tttcatgaag gatttgaaaa ctgtccatga aaataacgca 23160 
atcaaccttt tagcttgaga ctctattcac tgattagatt tttttaaata ctgatgggcc 23220 
tgcttctcag aagtgacaag gatgggcctc aatctcaatt tttgtaatac atgttccatt 23280 
tgccaatgag aaatatcagg ttactaattt ttcttctatt tttctagtgc catttccatg 23340 

15 tggaagagtt tctgtttcac aaacttctaa gctcacccgt gctgaggctg tttttcctga 23400 
tgtggactat gtaaattcta ctgaagctga aaccattttg gataacatca ctcaaagcac 23460 
ccaatcattt aatgacttca ctcgggttgt tggtggagaa gatgccaaac caggtcaatt 23520 
cccttggcag gtactttata ctgatggtgt gtcaaaactg gagctcagct ggcaagacac 23580 
aggccaggtg ggagactgag gctattttac tagacagacc tattgggatg tgagaagtat 23640 

20 ttaggcaagt ttcagcacta accaatgtga gaaggcctcc agagatgagc agttggtgaa 23700 
agagaggctc aaaaccagct accatacagg tcaagaagaa tttggcatta aggaaacagc 23760 
atagcaggat tccagacagg caactggtca acaacatgaa ggtctggaag aaaggtcgca 2 3820 
gtactcaggt tcagggcact acttcagctt cagcccttgc aaaaactggt gagagttgga 23880 
aagtctttag gctaagaaaa attggattat ttaaaagggg taaagaaagg gactcaagga 23940 

25 ggaaggatta aggcaagaac taggttccaa gaaacagggc atgagagaga gtcttgatct 24000 
accactatag ttctcgtggt agcatcagaa tcacctggga acgtagaaat . gcaaattctc 24060 
ctgctctaca ctagacctac caaatcagaa tatctagggg gtggggccca gcagtctgtg 24120 
cgcaaacaag cactgcaggt gattttgatg cacattatag tttgaaaact aggccaggtg 24180 
cagtggctca tgccaataat cccagcactt tgggagactg agacgggagg attgcttaaa 24240 

30 cccaggagtt tgagaccagc ctgggcaaca cggcgaaacc ccacctctaa ttaaaaaaaa 24300 
tacaaaaatt agctaggtgt gatggctccc acctgtgctc ccagctattc aggaggctga 24360 
ggtgggagaa tcacctgagc ctggaaagtc gaggctgcag tgaattgtga tcacaccact 24420 
gcacttcagc ctgagtgaca gagtaagacc ctatctcaaa aaacagaaaa agaaaaacac 24480 
tggcccaaag gaaatgaact tgttacagaa gccggggttc aaaacaccaa ataatgcact 24540 

35 tgtacctagt ccttcccggg tgctctgcag acatttctcc aagcgtagtc tgcaaacaac 24600 
ctacatatgt agaattacct atgcacattt ttcatttaac aaccaagagc tacatttgta 24660 
gcaaaatctg ggttgtaact tagcctacag ctgaagccta agagattccg tctgtgagaa 24720 
gaaataaccc acctctttgg cccccctccc caggcaggaa gccaggatgg tccttatata 24780 
aagttgtgct gtccaatagg taaccacCag ccacatatgg ctatttaaat ttaaattaac 24 840 

40 tacaattaag agaaattaaa aattcaattc ctcaattgca cctgccaaat tttaagcaca 24900 
taacaaccac atgtggctag taactactgt attggagagt gcaagcggag atagaacact 24960 
ctattactgc agaaatttct attggatagc acttataata gtttagtgta acttaaaact 25020 
ccctagttgc cacagtcatg atttagtagt aatttcatgg atttctctac tgaggttaga 25080 
atctctgcca ttagagactg ataaatttaa agtttgcaat tatcaaactg gtgacaattt 25140 

45 aagccagaat caggtaatgt cctcagtttt aacagcattg gaattttctg ggactagctg 25200 
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tgtatctatc caggattctt gagaatgcct 
ttacacatat acctggggat ggggaggtag 
tatcaaagag tggcagaaca tcacaatcaa 
aaaaagacaa tatttgctgg acctgatctt 
5 tttttacctc ctctgtttct acttaattgc 
ataacccctt ccttcactca tccatatctt 
cttctttatg aacctttcct ggaacaatcc 
caatgtattt aatatccatg tatctattct 
attttcattc attatgtgtc caacttccat 

10 tttatgacaa ttatcttcct tgggtttgtg 
ccaagaaccc aggagaatat attagctaag 
ttcaggagat caaaaccatc ctggctaaca 
aaaaattagc ccggcgtggt ggcaggcgcc 
aggagaatgg cgtgaaccgg ggaggcggag 

15 ctccagcctg ggcgacagag cgagactccg 
agattttaaa aaaaaaaaaa aaaaggatgt 
tcattccaga aatgaaatgg caaatacatt 
taaaggctca atttagtcac atcatttccg 
ttgccaatgc attaacaatg cagatgtttc 

20 accttatttt cttttcaagc agaaaagact 
ggagaagata taaatgatac acattatttc 
tagttattgt acctgttgtc tttttgctgt 
aagatcactc atacatgcat gtgcacacat 
atccacatga actaagatta ctgatgtgta 

25 aggcaagaag ctgagctact ttccagaata 
tgtttcctcc acaccacctc catccagttc 
tgagataaat tgagtgtata aaagtcattt 
accagaatct ctcctcattt gtggatgggc 
ggaggaaata ctagatttga ttgcagatca 

30 tcaagagaaa gcaagcaaca gactggggct 
tgaaatatga aatactgttt agcagtgtca 
cttcataaga acctttctct tcagagttgg 
ggaaataaag tgatcacttg gtgaagaaat 
tcatctggag taatgaacag attgaacaaa 

35 tgtaggtgag ctgtttgcaa gagccacaag 
taagggtgaa agttgcaagc aggcaagacg 
aaccaacaac cactgggttg gttacgcagg 
aacaaatgtt tgtcggaatt gttgacttaa 
gctagatagc cccattcagg gagagggcat 

40 taagggactg ctgggatcct gtccagcttt 
cgtatcccgt ctgcggtcac ggtcatttct 
gccaggctgg agccaagggc caacgcagcc 
agccccctat ggttccggaa cgcgctgccg 
aaaggcactg gcgggccggg ccagctCctia 

45 ggaacggcgt gaggccaatg gaaggaggta 




PCTAJSOO/15728 



gccatttttc aacataatgg atgtaaggta 25260 
gtataattgc acaagcattg tggagaatgg 25320 
ggttttccct ttGttttacc tttgcttttt 25380 
ataactcata aatgggacac tgtatgttcc 2544 0 
accctatgag gactgcttcc cttacctacc 25500 
tactcttctt cacaactctg taatattgac 25560 
ctcttaatgc aagcactgtt attatgcctt 25620 
ctctaatttt gtcattttgt gttctcatgt 25680 
ggataacatg gttacaacaa aagatcctac 25740 
ggacatagaa cagtgcacag agtaggggat 25800 
aagataactt ccgtttttaa aagtccaaga 25860 
tagtgaaacc ccgtctcttc caaaaataca 25920 
tatagtccca gctacacggg aggctgaggc 25980 
ctggcagtga gccgagatcc cgccactgca 26040 
tctcaaaaaa aaaaaaaaaa aaaaagtcca 26100 
ctgctttgtg agtttagcat tgtctccttg 26160 
taaatcagaa ctaaaaaggg gaacagggta 26220 
tttctcaccc acccccttta aaccagatgt 26280 
ctgaaagaaa gtttagtaac tcaagcagac 26340 
atgagatggt ggttgtggtt gttccgggag 26400 
aaatcatttc atgacctcac tgcacactta 26460 
caagcctagc taagatcatt tggaatgttc 26520 
acacatgcac atatgttcac tccctatttc 26580 
cagattcaaa gcacttttat tcttttccaa 26640 
gttgtgaaag accctgtcat acttctgcat 26700 
cttatgaatg gttactggtt ttcaaaaata 26760 
ttagacaaaa tgaaacagga aatgaaagaa 2682 0 
cagctccacc atgtcatggt taatctgcag 26880 
gactgcagca aacctgctgt gactaaggca 26940 
tcagtggtga aaacattata tatctagctt 27000 
cctagaaaag agtgtttcaa aatgctgatg 27060 
tttcttttat ctttcaaatt agccagggtg 27120 
ctcacaaaga agaacataga gagttcactt 27180 
ctagaaatgg ttagtctgtt aaagaaaagg 27240 
ggaaagggga agacaacttc tttgtggact 27300 
attctgacct ccattaagaa agcccaaacc 27360 
ttgggcagca ttgggagcaa atgttgattg 27420 
agagctgttc tgtcactggg gacagcagcg 27480 
ttgttcacct ggccagagat cagagcaggc 27540 
gagaccctac agagccatgt tcacctagca 27600 
taccttattc cagggctttc acctcagctt 27660 
gcgccttgtt cgcgatggta gcttcccagg 27720 
gccccatcct gtttgctacc tcctaaagcc 27780 
aagtcgcgca aggttagaag gttccggaca 27840 
cttcagtttc cctccagatg cccagcgatg 27900 
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ggctcagagc tccttgagaa ctcgggaaag 
agtagaaaga ggaagctaga gggttaaatg 
tagagttagt atatgtctag aggtgtagta 
cacgtaggga agaaatgaaa acctttgaat 
5 gtattactag atagctttca tcaacagctc 
cattttggct ttctaaagct ttaatttggt 
gacaattcta agcatgcgca aactggcccc 
tttggccctt tatgtgaagt acctggtttt 
tcggtgtgtg gcgtatttat tctacattta 

10 aaaaaagaat acaatggaag ccaagtgatt 
gcatatgcat ttaccgatag ttaaccgtat 
ctatttctta ttctattttt gacctaaaga 
agtctccatc cacttccctc agctttggcc 
gctcttgccc tgtacagcta gctacagaga 

15 ctcagtggca gataacatgc aaagttatta 
ggggccagga tgttacacta aggtcttaag 
ctctattatt cctatctcag atgataaaaa 
tgtaatcgca cctggctcta caaagctagt 
agattatttg atgaaatgca tttcaatgac 

20 cccatctgtt cccagtcaaa tgacctagag 
atagcaagct gtacatcaca aacaaatgaa 
tctttattgt acaacttggg tgggtaagta 
gtaacggtta cgttggagtt aaaggttagg 
tgggctggca ttgtcaatga agagcataaa 

25 atgtgtgttg ttggtttttg tgtatgtgtg 
agactatgac taacaaaaat gtttgacaga 
tgccttccca ttttccccat tattccccaa 
gagcagacat ctcatacccc aaatagctaa 
aacattccaa aaccaagtag tttgtaatat 

30 cttggcaagg atgtttggtc aggggttggc 
acaaccatat ttcttagcca tccaccagaa 
tgtcaggagt cagactagct acatcataat 
atcctggcct aactagccta ctgagctgag 
cagaggagaa ggaccgtgat atcattgcat 

35 atatgttcag ttgtaactat caataccagt 
gctgacaatc aattggagtt acagccagac 
gggctttgca ctgagacagg tcgtgcatct 
aactgaagtc tgatgagaca gccagagcat 
cccatttcac ttctcaccac cttccggggt 

40 tggtgcctta gccagccctg gctctccctc 
catggtgccc agacatgctc tccctcctca 
cccccagcgg ggacatgggc ataggagcag 
cctatgctat cttggaaggg ggcttggcca 
agcacatttg gagggtgaat ggtgggggca 

45 gaacacatcc tggcatgaaa gttgcagtcc 
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gaagcagggt ctctgaagaa atacttcagg 27960 
cactacacag gaacagaaat gagtttttct 28020 
aactaaaaca agtcttgaat tgcataccgc 28080 
attagtgaaa aaagggaaac tgcaacgcct 28140 
aaaaccgaca gatttaaaga agcaacaccg 28200 
ttggatccca tgcccatgac cctgccagct 28260 
aaaaattcct cccacatttc cgaagaacta 28320 
tccattttct gttttaccat aggcctcagt 28380 
acaatttgaa gatcattcta ttagattaaa 28440 
aagctttcct tatgcttata ttaagttgta 28500 
taacctacag aaaatgtcca gggaaatggt 28560 
aaatctttaa aatgtcttag cattttcccc 28620 
tgaagctatc tttaaaggta ccctgtacaa 28680 
ttcaatcctt tctgttcgat taggacacat 2 8740 
tatgtatgaa ccagaacttg tttttcctta 28800 
actatagtaa tatcttcact tgaaaaagcc 28860 
ttcaattaag agaaataaga acgtgacatg 28920 
ctggacagac atttaaacaa ttatcctcta 28980 
tagttaacca ttaaaaacca aagtgagcat 29040 
caaaggacta ggcaaaccac atctgtgggc 29100 
tttgctttgt atatgagtga gagcaaacac 29160 
gggagaataa tggttttact gaaatcgcag 29220 
aagaaaacca aagggtaaga gctgttgttc 2 92 80 
ttcagatgtg aatgtatatt ttgtagaagc 29340 
agtctgaaag agggaaaaca ggctcccatt 29400 
ttataactca gatgtcttac tcagagcata 29460 
catgatgtct ttaagaactt gtccttgacc 2 9520 
tattttgata gctatgatcc tgaacggcca 29580 
ctttaatgca aatatatttt aggccttttc 29640 
aaaaataatg ctcttcagac ttaaaagaac 2 9700 
agtagtagaa cgctccagga agcaagtctt 2 9760 
ctctctgccc aggggctgtg gatgtcatcc 29820 
agatgtccaa tttcccccca atacactaac 29880 
gtgaattctt • aattccaatt gcttaaacaa 29940 
atataacagt gttggccaag ttttattgat 30000 
acatggtctt atgaccggcg tacttacgca 30060 
gaggtttact gctttgcatt tttgttttgt 30120 
gtgctaccta gggacttgaa tccctgcagc 30180 
ggtttctcga cctcccactc ccctaccacc 30240 
caaacacctg cccaatgagc actgccaccc 30300 
tccctaccta gctaccattg ccactcccct 30360 
ggagagttaa ggttggtcag gtgcacgtgc 30420 
tgtggcatct ctggaccaag aatgcgccac 30480 
caccccttgt ccacctctat ttcaggcatg 30540 
cttgggaatc acctctccac cttgattgcc 30600 
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acagtaggcc agtgacaagg gaagattgac acatcatccc tgctggggcc cagtgtcctg 30660 
tggctggcag gcaggggatc ctaaggacat gtgggtctta aattgtaggg tgcacttcct 30720 
gggcaccttt gagggt.ctgc actgccccag caaatatccc catgctagaa ggagcaaaat 30780 
attaaatggc aaattttaaa aatgtaacaa gatgggttgc aaaagagact acagaggaaa 30840 
5 gcaaaagttt tgtattttag tatcttccat ggcacttttc ttcctagctt ttgaacaagg 30900 
ggccccacat ttttatttct cactgagccc cacaaagtat gtagccattc ctgcccggag 30960 
tgaggacttt taaaacataa agattatcaa gtcttggaaa ttctgattca gtagatatat 31020 
aacaggt'ctc aaacttaatt atgtaaagaa tattctggag agcttcctt.t tacccagtcc 31080 
cacccaccaa atattctgat aaattaagct tcgattagcc cccagatctg cattttataa 31140 

10 ggatccccag atgattctac tgcaattggt ccacagacca tgcctggacc gaatttgggt 31200 
gcttaggagc acaaattctg gagccgggca gacttgagtt tgcttcctag ctttaccaac 31260 
tgatctcagg ggagttaatg tttacctcta aactttagct catgcatcta taaataaata 31320 
tattaatatc atgtcataag gatattatgt tgtattaaat gtctttaaaa caccacaatg 31380 
attagcccaa agtaaacact caataaatgt tcaaaaattt aggaaaattg ttaagactgg 31440 

15 gttgtatgca cactggtgtt tattatatta tgtagttttt tctgtatttt tacaacattt 31500 
cagaattaaa agcaacagct agaaaaagag ggaaatggcc gggtgcagtc gtcacgcctg 31560 
taatcccagc actttgggag gccaaggcgg gcggatcacg aggtcgagag atcgagacca 31620 
tcctggccaa catggtgaaa ccccatctct actaaaaata caaaaattaa ctgggcatgg 31680 
tggcatgcgc ctgtagtccc aggagaattg cttgaacctg ggaggcggag gttgcagtga 31740 

20 gccaagatct caccactgct ctccagcctg gtgacagggc aagactccgt caaaaaaaaa 31800 
aaagagaggg agagccagag tatgaaaaag gaagtcagag ccctttaatg agtcagcttt 31860 
gtaggtctcc aggtaggagg ctagtgcttc agtgtctagg acatagtagg tgttcagtaa 31920 
attaaattca ggacaaaaag aacatgcccc aaggaccatc tgatatccac ttaaagtgat 31980 
ggactacctc gtttcccttg tttatgaatg ggttcatgcc taagactgtg tgcactttaa 32040 

25 tacaagggca gtcgttcaga actagtcagg tcctgaaaag gatttaccaa atgttgagtg 32100 
tgccctctag tgttcacact tcccagcttt cttcctataa aggtggatca aggcacttgc 32160 
ttacaactgg aactgaaatc ctccaagtcg atctagacat tgagatggag aaaatattca 32220 
ttgtcgactg taattatgca acgaatatcc agttgagata atggacttgc ctcttatcta 32280 
ataataccca ggctcaatgc gtcactgctt tgtccacttt gcccaaaatt caagcacagc 32340 

30 taagttgata ttttaggaca aaggcagctt act:at:ccagc cagaggggag tagaatatgg 324 00 
ttaagagaga gtggaaagaa tgaatgagcc ctgctattcc tcactgcctg gatggctata 32460 
agcacagccc ttatggaggc cttaggtctt gcttcacaat attccagttt gaaaagggtt 32520 
tgaaaagacc tcctagaaaa atcagtagtt tttctctttt gagtaacatg tagcaaaaaa 32580 
aatttcatca tgtaggtaca gggaacaccc tagtaactat taatctcaag gagtcaagcc 32640 

35 agtgtgtttc ctaatgtatc tgctgtatcc ccatgaagca. aattttgcca tcagagaaac 32700 
tgactcatgg ggaaaaaatc caaggacctc aaatcaccaa aagaagccat tcctcagatt 32760 
tgcctaagct taagcttccc tgtctctcat tgtgtgttgc tttcaatgca gttacataaa 32 820 
tggctttttt gtttatgcac caaaaacact aattcatctg caaagctcac atttccagaa 32880 
acattccatt tctgccagca cctagaagcc aatattttgc ctattcctgt aaccagcaca 32 940 

40 catatttatt tttttctaga tcaaatgtat tatgcagtaa gagtcttaat tttgttttca 33000 
caggttgttt tgaatggtaa agttgatgca ttctgtggag gctctatcgt taatgaaaaa 33060 
tggattgtaa ctgctgccca ctgtgttgaa actggtgtta aaattacagt tgtcgcaggt 33120 
aaatacacag aaagaataat aatctgcagc accactagct ctttaatatg attggtacac 33180 
catattttac taaggtctaa taaaattgtt gttgaataaa ttgggctaaa ggcagaaggg 33240 

45 tcataatttc agaacccacg tcgcaccgtc ctccaagcat ccatagttct tttgatatac 33300 
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ccctattatc actcatttca gtgaggtaca 
agaaggcctt cccaaaaatc agtgtcatgt 
caacctgtag caggtcctca gaaaacaaac 
tcaaaaaagg ggtgaggata cttgaaattt 
5 attcattatc accaatgaaa ggcttataac 

agtcctgaat ggctttttgg tctgaaaaat 
attatcacaa tataagaatg agatctttaa 
gtcacttaga aaatctgtgt atgtgaaata 
taataggtga acataatatt gaggagacag 

10 gaattattcc tcaccacaac tacaatgcag 
ttctggaact ggacgaaccc ttagtgctaa 
acaaggaata cacgaacatc ttcctcaaat 
gagtcttcca caaagggaga tcagctttag 
accgagccac at'gtcttcga tctacaaagt 

15 gcttccatga aggaggtaga gattcatgtc 
aagtggaagg gaccagtttc ttaactggaa 
aaggcaaata tggaatatat accaaggtat 
caaagctcac ttaatgaaag atggatttcc 
a999cctctc actaactaat cactttccca 

20 atgatcattg ctttttctct ttacagggga 
tagaaaatgg aaccactaga ggaatataat 
cccagccctt gacaaaattg tgaagttaaa 
tctccactat ggcaactaac tcactcaatt 
atcttctttg cttctccaac caaaacatca 

25 tctttggtct actctatcac aaggccagta 
gctgagaggc taaaactcat caaaaacact 
ttttaccttt tccaaatccc aatccccaaa 
ccttttaccc tccatggtcg ttaaaggaga 
tacacagtta tacatgtcta tcaaacccag 

30 agaacatagg gatgaagtaa ggtgcctgaa 
ttaagttatt ttatatatat aatatatata 
agtgtgtgtg tgtatgcgtg tgtgtagaca 
taagccattc taagagcttg tatggttatg 
caagattggc atatcattgt aactaaaaaa 

35 ttctaaaaat aataataata atgctaacag 
agctagtaga gactttgagg aagaattcaa 
gcaagaagtt gaagttgcct agaccagagg 
taccccgaag tggagaaggg tgcagcaggc 
ctaagttgtc cttttctggt ttcgtgttca 

40 tctatcttga atcttctaga gagttgctga 
aacaaactgg tgttctggtt cataccttgg 
tcaccctgta tttgatgatg catgggacta 
gctgccttct cctgccccaa cctcaccccc 
gttcttttag tcaatatatt tttgtcttcg 

45 tttcttggct gggcccagtg gctcacgcct 
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attagttctt gatgtagcca tttccatacc 33360 
caccgatcct tttatctctg gtgcttggca 33420 
atttgaatta atggccaaat gagtttgtgc 33480 
ggaaaatcta ggataattca tgactagtgg 33540 
agcatgagtg aacagaacca tctctatgat 33600 
atgcattggc tctcattaca tttaaccaaa 33660 
cattgccaat taggtcagtg gtcccaagta 33720 
ctgtttgtga cttaaaatga aatttatttt 33780 
aacatacaga gcaaaagcga aatgtgattc 33840 
ctattaataa gtacaaccat gacattgccc 33900 
acagctacgt tacacctatt tgcattgctg 33960 
ttggatctgg ctatgtaagt ggctggggaa 34020 
ttcttcagta ccttagagtt ccacttgttg 34080 
tcaccatcta taacaacatg ttctgtgctg 34140 
aaggagatag tgggggaccc catgttactg 34200 
ttattagctg gggtgaagag tgtgcaatga 34260 
cccggtatgt caactggatt aaggaaaaaa 34320 
aaggttaatt cattggaatt gaaaattaac 34380 
tcttttgtta gatttgaata tatacattct 34440 
gaatttcata ttttacctga gcaaattgat 34500 
gtgttaggaa attacagtca tttctaaggg 34560 
ttctccactc tgtccatcag atactatggt 34620 
ttccctcctt agcagcattc catcttcccg 34680 
atgtttatta gttctgtata cagtacagga 34740 
ccacactcat gaagaaagaa cacaggagta 34800 
actccttttc ctctacccta ttcctcaatc 34860 
tcagtttttc tctttcttac tccctctctc 34920 
gatggggagc atcattctgt tatacttctg 34980 
acttgcttcc atagtggaga cttgcttttc 35040 
aagtttgggg gaaaagtttc tttcagagag 35100 
taaaatatat aatatacaat ataaatatat 35160 
cacacgcata cacacatata atggaagcaa 35220 
gaggtctgac taggcatgat ttcacgaagg 35280 
gctgacattg acccagacat attgtactct 35340 
aaagaagaga accgttcgtt tgcaatctac 35400 
cagtgtgtct tcagcagtgt tcagagccaa 35460 
acataagtat catgtctcct ttaactagca 35520 
tcaaaggcat aagtcattcc aatcagccaa 35580 
ccatggaaca ttttgattat agttaatcct 35640 
ccaactgacg tatgtttccc tttgtgaatt 35700 
ctttttgtgg attccattga tgtgaatcag 35760 
ctgacaaaat cactctgacc ctgccaagct 35820 
agccaggcct cactcttgct agttccttta 35880 
catataagta taaataaaca tatttttaaa 35940 
ataatcccag cacttctgga ggccaaggtg 36000 
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99cggatcac ctgaggttag gagtttcagg ccagcctggc caacatggtg aaaccctgtc 36060 
tctactaaaa atagaacaat tagctgggct tggtaatgtg cacctataat cccagctact 36120 

ggggaggctg aggcaggaga atcacttgag cctggggagc agggggtgcg ggaggttgca 36180 
gtgagacaag atcgcaccag tgcactcccc atcctgggtg acagagtgag actctgtctc 36240 

5 aaagaaaata aataaataaa tacatttctt gaggcgtttc ttgttaaatc attcatggag 36300 

aggcatccca aacaccacat tcaacaaaac actctgaaaa atgttttcaa atgcaatata 36360 

acacagcaga gatttgatgc tctgttatcc agttttcata taaggctgtg tgagctgtgt 36420 

cccagagagg acagtggtct gaatccacct gagacagaat tgggtctaac taactgtgag 36480 

tatggccttc aataagtcac tctccatttg ggaatttgat ttctccactt gtataatgag 36540 

10 agtatttgac aggatgctct cccaaatccc ttgcaatttt gttagtctgt gatttcatgt 36600 

ttttattttt attccttcat ccaacaaata gtcaaggagt aattgctgtg tgccaaatac 36660 

caacagtatt cattaaattg taattcagat tttatatata tataaataat gtataatgtg 36720 

tataaattgc tttgtgagtg cctactacac tgctagacag tagttgctca ataacttgtt 36780 

agctgaatca gaatccatgt ttatcccaga gtagcaatta gtcttgcatc gagtatcgtg 36840 

15 aaagaaggcc acacttaaat aagaataatg cctggggttt aggttttatg aaaaaatgaa 36900 

aggaaattag. ttctgctttt gttgactaaa ggaagggaag agagaagaga cactataatt 36960 

gtctgcctca gatttaagga ggaggctaat tcatgcatta aacacgttac ttcaaatttg 37020 

aatgaccaaa ggtctgtagc ctcagcactt caaaattggt aaaagtaaga cactctggcc 37080 

ttgtttccat agagaccacc ccttacaaag gcaccaatgg gaaactggcc tcaggactcc 37140 

20 tgttattggt cttctctgtg gcagagaaag gagctcttgg acccataaat ctctgagcca 37200 

cagttctttt tgccatgggc tcaaaaatga ttgaattcat catgagccac ctgtggcata 37260 

ttgccacact aaacatgtgg ggcctttaag ctcactaaga gccaatgtct tcagagccag 37320 

ccctggcttg attctaccta gggcatttgc agttgccata taagaatcat tagtgctttc 37380 

aaaattactg tagatacttt gcctaaatag actaaaacat gctgccgtca tattggaagt 37440 

25 gacagattaa aatagaactc ttgccaagtg aaggaaagtg tgctaatata atgcagtcat 37500 

tttaacttgc tgtttaagtg tgattgtttt tagttctttt gaatattatt tgttttatac 37560 

tgacaggaac gaagtactgt ccaattttct ctgccaagga aaaaagaaaa ggtgttcttc 37620 

cttacttacc tgaaccaaaa cagaccagtt tacaaaattg cctaattata attgctaaac 37680 

aagttccgaa tgcttacagt ctaatccaag aatgtcagag ctgcaagggc ccttaaacac 37740 

30 catccaatcc actccactca tttagcagat gaagagattg agggcaacat aaggccaggc 37800 

ccaagataac acaatgacag ccaggactag agctcaagtc tcccaccctg cactttgaaa 37860 

gaataatgct ttcaactgga gtacattaac tctactgtct atatttttag ggcagctggg 37920 

gcattctgca ttggtggcaa tcctctcaac aaccctggga ctgaaaactg cctggaattc 37980 

ttactaacaa ttctctaatt gaccaaaagg tgacgaaatc aaggagacca ataaggtagc 38040 

35 cttggaaagc aagagtggc 38059 

<210> 5 
<211> 29 
<212> PRT 
40 <213> Homo sapiens 
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<400> 5 

Met Gin Arg Val Asn Met lie Met Ala Glu Ser Pro Gly Leu lie Thr 
1 5 10 15 

5 He Cys Leu Leu Gly Tyr Leu Leu Ser Ala Glii Cys Thr 

20 25 

<210> 6 
<211> 55 
10 <212> PRT 

<213> Homo sapiens 

<400> 6 

Val Phe Leu Asp His Glu Asn Ala Asn Lys He Leu Asn Arg Pro Lys 
15 1 5 10 15 

Arg Tyr Asn Ser Gly Lys Leu Glu Glu Phe Val Gin Gly Asn Leu Glu 
20 25 30 

20 • Arg Glu Cys Met Glu Glu Lys Cys Ser Phe Glu Glu Ala Arg Glu Val 
35 40 45 

Phe Glu Asn Thr Glu Arg Thr 
50 55 



25 



30 



<210> 7 

<211> 8 

<212> PRT 

<213> Homo sapiens 

<400> 7 

Thr Glu Phe Trp Lys Gin Tyr Val 
1 5 



35 <210> 8 

<211> 38 
<212> PRT 
<213> Homo sapiens 
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<400> 8 

Asp Gly Asp Gin Cys Glu Ser Asn Pro Cys Leu Asn Gly Gly Ser Cys 
1 5 10 . 15 

Lys Asp Asp lie Asn Ser Tyr Glu Cys Trp Cys Pro Phe Gly Phe Glu 
20 25 30 

Gly Lys Asn Cys Glu Leu 
35 

<210> 9 
<211> 43 
<212> PRT 

<213> Homo sapiens 
<400> 9 

Asp Val Thr Cys Asn lie Lys Asn Gly Arg Cys Glu Gin Phe Cys Lys 
15 10 15 

20 Asn Ser Ala Asp Asn Lys Val Val Cys Ser Cys Thr Glu Gly Tyr Arg 

20 25 30 

Leu Ala Glu Asn Gin Lys Ser Cys Glu Pro Ala 
35 40 

25 

<210> 10 
<211> 68 
<212> PRT 

<213> Homo sapiens 

30 

<400> 10 

Val Pro Phe Pro Cys Gly Arg Val Ser Val Ser Gin Thr Ser Lys Leu 
1 5 10 15 

35 Thr Arg Ala Glu Ala Val Phe Pro Asp Val- Asp Tyr Val Asn Ser Thr 

.20 25 30 

Glu Ala Glu Thr lie Leu Asp Asn lie Thr Gin Ser Thr Gin Ser Phe 
35 40 . 45 



40 



Asn Asp Phe Thr Arg Val Val Gly Gly Glu Asp Ala Lys Pro Gly Gin 
50 55 60 



Phe Pro Trp Gin 
45 65 
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<210> 11 
<211> 38 
<212> PRT 

<213> Homo sapiens 

5 

<400> 11 

Val Val Leu Asn Gly Lys Val Asp Ala Phe Cys Gly Gly Ser He Val 
15 10 15 

10 Asn Glu Lys Trp He Val Thr Ala Ala His Cys Val Glu Thr Gly Val 

20 25 30 

Lys He Thr Val Val Ala 
35 

15 

<210> 12 
<211> 182 
<212> PRT 

<213> Homo sapiens 

20 

<400> 12 

Gly Glu His Asn He Glu Glu Thr Glu His Thr Glu Gin Lys Arg Asn 
15 10 15 

25 Val He Arg He He Pro His His Asn Tyr Asn Ala Ala He Asn Lys 

20 25 30 

Tyr Asn His Asp He Ala Leu Leu Glu Leu Asp Glu Pro Leu Val Leu 
35 40 45 



30 



Asn Ser Tyr Val Thr Pro He Cys He Ala Asp Lys Glu Tyr Thr Asn 
50 55 60 



He Phe Leu Lys Phe Gly Ser Gly Tyr Val Ser Gly Trp Gly Arg Val 
35 65 70 75 80 

Phe His Lys Gly Arg Ser Ala Leu Val Leu Gin Tyr Leu Arg Val Pro 
85 90 95 

40 Leu Val Asp Arg Ala Thr Cys Leu Arg Ser Thr Lys Phe Thr He Tyr 

100 105 110 

Asn Asn Met Phe Cys Ala Gly Phe His Glu Gly Gly Arg Asp Ser Cys 
115 120 125 

45 
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Gin Gly Asp Ser Gly Gly Pro His Val Thr Glu Val Glu Gly Thr Ser 
130 135 140 

Phe Leu Thr Gly lie lie Ser Trp Gly Glu Glu Cys Ala Met Lys Gly 
5 145 150 155 160 

Lys Tyr Gly lie Tyr Thr Lys Val Ser Arg Tyr Val Asn Trp lie Lys 
165 * 170 175 

10 Glu Lys Thr Lys Leu Thr 

180 

<210> 13 
<211> 2410 
IS <212> DNA 

<213> Mus musculus 



<400> 13 

gaattccgtg gatgtgcttt aaaaccacac 

20 gcgctacaac ttcatcagaa acgaagtccc 
gtctaatgac taagtcttcc aaccacaact 
tctggggacc cggggctcag gccgcctcgc 
catcccccgg gcatgctgag catccccccg 
tctctgcctc cgcgcggggg cccggccctg 

25 ggatgaaagg cggatacttg gaccagcgag 
gaaatgggag cttgggcgaa gcgctgatgg 
ccctgccgcc ttccgactca gaagatctct 
ggctcgcaga agctcaggta ccggacagtg 
aaaacttagc tttccatagc cccaccacca 

30 cagaccccgc cctgtcctgc agcaggaagc 
gcctttactc cagacaaatc gccatcaagt 
ccctgcagcc cttttccagg gcagaacagc 
cccagtccca ccctggccac gggtaccttg 
tggacatgtg ccactccttc acatctcctc 

35 cctatcaaca ccaactgtcg gagccctgcc 
agtaccatga ccccctgtac gaacaggctg 
gtgggcacag gtacccaggg gcgggggtgg 
acgactcaga tgtccctgga tgtgcatcaa 
cctctccagg tgatggagtg atgggttatg 

40 atgatgtctg cattgtccct aaaaaatttg 
ctttccggga ggggccaccc taccagcgcc 
tggccctgct ggatgaccca acaaatgctc 
agtttaaact aattgaacct gaagaggttg 
cagccatgaa ttatgacaag ctgagccgct 

45 tgcagaaggt ggctggcgaa cgctacgtgt 















dv^dk*ddd^j^d 








dd^ww^^^^^ 








U U d^ u c g c. g U 


O AO 
^ 4 U 


cggctccggc 


acagacgccc 


ggacctcagg 


300 


tggccggagg 


gagcggccgg 


atggagcgga 


360 


tgccctacac 


cttctgcagc 


aaatctcccg 


420 


tcccgcaggg 


aaagctcatg 


gacccgggct 


480 


tccaggacct 


cagtcacttc 


caagagacgt 


540 


atgagcagtt 


tgttcctgat 


ttccattcag 


600 


ggatcaagaa 


ggaaccccag 


agtccccgca 


660 


caccactccc 


ctaccaccat 


ggagagcagt 


720 


cccccgctcc 


cggtgcccct 


ggacagtcgc 


780 


agcagagcct 


cctgagagcc 


tccagctctt 


840 


gtgagcacag 


ctccgtcttc 


cagcagcccg 


900 


agggaggggg 


ccgggaacct 


ctcccagccc 


960 


caccctaccc 


ccagcagaac 


ttcaagcagg 


1020 


gccagcccgc 


ttcaagccag 


ggtggggtca 


1080 


tgatcaaaca 


ggagcgcaca 


gacttcgcct 


1140 


tgtacctcca 


cccagagggc 


ttctctggac 


1200 


gctatgaaaa 


atcccttcga 


ccattcccag 


1260 


aaggagacat 


caagcaggaa 


gggattggag 


1320 


ggggtgcctt 


acaactgtgg 


cagtttctgg 


1380 


atttcattgc 


ttggacaggc 


cggggaatgg 


1440 


ccaggctctg 


gggtatccag 


aagaaccggc 


1500 


cgctgcgata 


ctattatgag 


aaaggcatca 


1560 


acaagtttgt 


gtgcgagccg 


gaggccctgt 


1620 
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tctctctggc cttcccagat aatcaacgtc cagctctgaa ggctgagttt gaccggccag 1680 
tcagtgagga ggacacagtc cctttgtccc acttggatga gagtcctgcc tacctcccag 1740 
aactcactgg ccccgctccg cccttcggcc acagaggtgg atattcttac taggcaccag 1800 
tggcttcccc ttgacatggt ggggttgctc agtgtatata tcaactgatt tggtattggt 1860 
5 gaaggccctc tttctgatgc ctgtagaagt ctctggggtc agagctccac tatcccatct 1920 
gatactcctg gccagactca gctgctaacc agagtctgcg ggaaagacag tggaggcagg 1980 
ccaaatctaa aggcagtagc tgaagttcgc tgtggctcac ctgtaccttc agttcagctt 2040 
ggcctctgcc taggtcttgc tcagaggcca agttcctcac ccccaccaca gagatccagt 2100 
gttctattct ggggacatac agggacttcc cttgtttatt atggcaacag ggccaagggg 2160 
10 attctcagaa caccctgtgt ctcccctctc ccaacccccc atgggagaca aagttctgcc 2220 
tggcttctgc cctgaacagg ggggtcctgt gttcttggtg ctgtgctctg ggaggcagga 2280 
gcatigtgggc ggcagctggg ggggggtgtg gaagtagaga tggctctctg ccctaggcct 2340 
acccaggcct aattccacct ttgcctctta tgccagacct taataaagcc tctgcttctc 2400 
cccggaattc 2410 

15 

<210> 14 
<211> 32 
<212> DNA 

<213> Artificial Sequence 

20 

<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 14 

25 taacaggatc cggcctctca ctaactaatc ac 32 

<210> 15 
<211> 31 
<212> DNA 
30 <213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Synthetic 
35 <400> 15 

caactggatc caagattcaa gatagaagga t 31 

<210> 16 
<211> 29 

40 <212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Synthetic 
45 ' 
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<400> 16 

ggacaggatc ccccaaactt ttcaggcac 29 

<210> 17 
5 <211> 29 

<212> DNA 

<213> Artificial Sequence 
<220> 

10 <223> Description of Artificial Sequence": Synthetic 
<400> 17 

caagcatgca tctagtgtta gtggaagag 29 

15 <210> 18 
<211> 29 
<:212> DNA 

<213> Artificial Secjuence 
20 <220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 18 

caagcatgca aatattaact caaaatgga 29 

25 

<210> 19 
<211> 28 
<212> DNA 

<213> T^tificial Sequence 

30 

<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 19 

35 caagcatgct gttgtttttt gttttaac 28 

<210> 20 
<211> 29 
<212> DNA 
40 <213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Synthetic 
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<400> 20 

caagcatgca gccattcagt cgaggaagg 29 

<210> 21 
5 <211> 27 

<212> DNA 

<213> Artificial Sequence 
<220> 

10 <223> Description of Artificial Sequence: Synthetic 
<400> 21 

caagcatgcg atcccttcct tatacct 27 

15 <210> 22 

<211> 27 

<212> DNA 

<213> Artificial Sequence 
20 <220> 

<223> Description of Artificial Sequence: Synthetic 

<400> 22 

taagcttaac ctttgctagc agattgt 27 



25 



30 



<210> 23 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 



<400> 23 . 

35 cttggtacca gccattcagt cgaggaagg 29 

<210> 24 
<211> 29 
<212> DNA 
40 <213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Synthetic 
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<400> 24 

cttggtacca tatgaatcct ttcatagat 

<210> 25 
5 <211> 23 

<212> DNA 

<213> Artificial Sequence 
<220> 

10 <223> Description of Artificial Sequence: Synthetic 
<400> 25 

ctgtgggaac acacagattt tgg 23 

15 <210> 26 

<211> 19 
<212> DNA 

<213> Artificial Sequence 
20 <220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 26 

ggaataattc gaatcacat 19 

25 

<210> 27 
<211> 20 
<212> DNA 

<213> Artificial Sequence 

30 

<220> 

<223> Description of Artificial Sequence: Synthetic 
<:400> 27 

35 ccaatctgct cacacaggat 20 

<210> 28 
<211> 20 
<212> DNA 
40 <213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Synthetic 
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29 
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<400> 28 

ccttgaggct gtccaagtga 
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20 



<210> 29 

<21X> 20 

<212> DNA 

<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: Synthetic 



<400> 29 

atggttaact gacttacgaa 20 

<210> 30 
<211> 26 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 30 

gctccatttt gagttaatat ttgtgt 26 

<210> 31 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 31 

ttcagtcgag gaaggatagg gt 22 

<210> 32 
<211> 22 
<212> DNA 

<213> Artificial Sec[uence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
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<400> 32 

ttcagtcggt tggtgatagg gt 22 

<210> 33 

<211> 11 

<212> DNA 

<213> Homo sapiens 

<400> 33 

tcgaggaagg a 11 

<210> 34 

<211> 15 

<212> DNA 

<213> Homo sapiens 

<400> 34 

agtcgaggaa ggata 15 

<210> 35 

<211> 19 

<212> DNA 

<213> Homo sapiens 

<400> 35 

tcagtcgagg aaggatagg 19 

<210> 36 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<400> 36 

attcagtcga ggaaggatag ggt 23 

<210> 37 

<211> 27 

<212> DNA 

<213> Homo sapiens 

<400> 37 

ccattcagtc gaggaaggat agggtgg 27 
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<210> 38 
<211> 30 
<212> DNA 
<213> Homo 



sapiens 



<400> 38 

gccattcagt cgaggaagga tagggtggta 30 

<210> 39 
<211> 7 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 39 

gaggatg 7 



<210> 40 
<211> 7 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 40 

caggaag 7 



<210> 41 
<211> 7 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 41 

caggatg 7 

<210> 42 
<211> 12222 
<212> DNA 

<213> Homo sapiens 
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<400> '42 

gaatcccagg ttggaggggc ggcaacctcc 
gccagaagag acagagcttg aggagagctt 
ctgctgctca ctcagttcca caggtgggag 
S tgcagatggg aaaacaaagg cccagagagg 
ggcgacctca accacagccc agtgctggag 
cattcagcca gctcagggga aggacagggg 
agggagctca gagagaaggg gaggggagtc 
ggtggtacct actcccttca cagggtaact 

10 caagtgtggc ccaccccacc ccagtgacac 
atcaggaggg gctttctggg cacacccagt 
cattttaatc ctcacagcag ctcaacaagg 
tagggaaatt gaggctcgga gcggttaaac 
gggttccctg. gtctgaatgt gtgtgctgga 

15 cccaaggtgg aggcataaat gggactggtg 
cattcactct gcaagtatct acggcacgta 
acagcggagc gatgcaatct gatttaggct 
tagcctcaac cctgtacctc ccctcccctc 
aaccccagag tgggggccat gtatccaaag 

20 ttttccagga gcaaaaacag aaacagcctg 
ctgagcagcc tagggggcag actaagcaga 
tgtgtgccag gcacttcacc cgaggcactt 
ctctttggga tgcaggtgaa acagttcctg 
cagcacagct cttctttaca gatgtgcttc 

25 tgccagcctg acggttgtgt ctgcctcagt 
gagggtgttt gtgtctaagg ctgactgggt 
gcctgtttcc tcatctgtca aacgggctct 
agtcttcagc atcaggcatt ttggggtgac 
aacagccact aaggattctg cagtgagagc 

30 gactgtctga ctcacgccac cccctccacc 
aggtacaatg actcctttcg gtaagtgcag 
ccgggcagcg taggcgggcg actcagatcc 
tccgataact ggggtgacct tggttaatat 
gatccactgc ttiaaatacgg acgaggacag 

35 actgacctgg gacagtgaat cgtaagtatg 
tccgagctcc ccatggccca ggcaggcagc 
gggtatccgc ctgctgaggt gcagggcaga 
ataat:aacag cagccatgag ggttgtgtcc 
gtacctcctg gtggatacac tggttcctgt 

40 gcagtcctgg caccccagga tgggggacac 
cagccatttc tttctgtttg cacagctcct 
cctataagcc tcaccacctc tcctactgct 
tgaggaggtt aaggttcaga gaggggtggg 
tccagatatg accaggaaca gacctgCgcc 

45 ccatgcaccg tggacctctg ggccctcctg 
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tgccagcctt caggccactc tcctgtgcct 60 
gaggagagca ggaaaggtgg aacattgctg 120 
gaacagcagg gcttagagtg ggggtcattg 180 
ggaagaaatg cctaggagct accgagggca 240 
ctgtgagtgg atgtagagca gcggaatatc 300 
ccctgaagcc aggggatgga gctgcaggga 360 
tgagctcagt ttcccgctgc ctgaaaggag. 420 
gaatgagaga ctgcctggag gaaagctctt 4 80 
cagcccctga cacgggggag ggagggcagc 540 
acccgtctct gagctttcct tgaactgttg 600 
tacataccgt caccatcccc attttacaga 660 
aactcacctg aggcctcaca gccagtaagt 720 
ggatcctgtg ggtcactcgc ctggtagagc 780 
aatgacagaa ggggcaaaaa tgcactcatc 840 
cgccagctcc caagcaggtt tgcgggttgc 900 
tttaaaggat tgcaatcaag tgggacccac 960 
cacccccagc agtctccaaa ggcctccaac 1020 
aaactccaag ctgtatacgg atcacactgg 1080 
aggctggtca aaattgaacc tcctcctgct 1140 
gggctgtgca gacccacata aagagcctac 1200 
cacaagcatg cttgggaatg aaacttccaa 12 60 
gttcagagag gtgaagcggc ctgcctgagg 1320 
cccacctcta ccctgtctca cggcccccca 13 80 
catgctccat ttttccatcg ggaccaticaa 1440 
aactttggat gagcggtctc tccgctccga 1500 
aacccactct gatctcccag ggcggcagta 1560 
tcagtaaatg gtagatcttg ctaccagtgg 1620 
agagggccag ctaagtggta ctctcccaga 1680 
ttggacacag gacgctgtgg tttctgagcc 1740 
tggaagctgt acactgccca ggcaaagcgt 1800 
cagccagtgg acttagcccc tgtttgctcc 1860 
tcaccagcag cctcccccgt tgcccctctg 1920 
ggccctgtct cctcagcttc aggcaccacc 1980 
cctttcactg cgaggggttc tggagaggct 2040 
aggtctgggg caggaggggg gttgtggagt 2100 
tggagaggct gcagctgagc tcctattttc 2160 
tgtttcccag tcctgcccgg tcccccctcg 2220 
aagcagaagt ggatgagggt gtctaggtct 2280 
cagccaagat acagcaacag caacaaagcg 2340 
ctgtctgtcg ggggctcctg tctgttgtct 2400 
tgggcatgca tctttctccc cttctataga 2460 
gaggaacgcc ggctcacatt ctccatcccc 2520 
agcctcagcc ttacatcaaa atgggcctcc 2580 
tcccagtgga ggacaggaag ctgtgagggg 2640 
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cactgtcacc cagggctcaa gctggcattc ctgaataabc gctccgcacc aggccacggc 2700 
taagctcagt gcgtgattaa gcctcataac cctccaaggc agttactagt gtgattccca 2760 
ttttacagat gaggaagatg gggacagaga ggtgaataac. tggccccaaa tcacacacca 2820 
tccataattc gggctcaggc acctggctcc agtccccaaa ctcttgaacc tggccctagt 2880 
5 gtcactgttt ctcttgggtc • tcaggcgctg gatggggaac aggaaacctg ggctgaactt 2940 
gaggcctctc tgatgctcgg tgacttcaga cagttgctca acctctctgt tctcttgggc 3000 
aaaacatgat aacctttgac ttctgtcccc tcccctcacc ccacccgacc ttgatctctg 3060 
aagtgttgga aggatttaat ttttcctgca ctgagttttg gagacaggtc aaaaagatga 3120 
ccaaggccaa ggtggccagt ttcctataga acgcctctaa aagacctgca gcaatagcag 3180 

10 caagaactgg tattctcgag aacttgctgc gcagcaggca cttcttggca ttttatgtgt 3240 
atttaatttc acaatagctc tatgacaaag tccacctttc tcatctccag gaaactgagg 3300 
ttcagagagg ttaagtaact tgtccaaggt cacacagcta atagcaagtt gacgtggagc 3360 
aatctggcct cagagccttt aattttagcc acagactgat gctcccctct tcatttagcc 3420 
aggctgcctc tgaagttttc tgattcaaga cttctggctt cagctttgta cacagagatg 3480 

15 attcaatgtc aggttttgga gcgaaatctg tttaatccca gacaaaacat ttaggattac 3540 
atctcagttt tgtaagcaag tagctctgtg atttttagtg agttatttaa tgctctttgg 3600 
ggctcaattt ttctatctat aaaatagggc taataatttg caccttatag ggtaagcttt 3660 
gaggacagat tagatgatac ggtgcctgta aaacaccagg tgttagtaag tgtggcaatg 3720 
atggtgacgc tgaggctgtg tttgcttagc atagggttag gcagctggca ggcagtaaac 3780 

20 agttggataa tttaatggaa aatttgccaa actcagatgc tgttcactgc tgagcaggag 3 840 
ccccttcctg ctgaaatggt cctggggagt gcagcaggct ctccgggaag aaatctacca 3900 
tctctcgggc aggagctcaa cctgtgtgca ggtacaggga gggcttcctc acctggtgcc 3 960 
cactcatgca ttacgtcagt tattcctcat ccctgtccaa aggattcttt tctccattgt 4020 
acagctatga agctagtgct caaagaagtg aagtcattta ccccaggccc cctgccagta 4080 

25 agtgacaggg cctggtcaca cttgggttta tttattgccc agttcaacag gttgtttgac 4140 
cataggcgag attctcttcc ctgcaccctg ccgggttgct cttggtccct tattttatgc 4 2 00 
tcctgggtag aaatggtgcg agattaggca gggagtggac gcttccctgt ccctggcccc 4260 
gcaaagagtg ctcccacctg ccccgatccc agaaatgtca ccatgaagcc ttcattcttt 4320 
tggtttaaag cttggcctca gtgtccgtac accatggggt ccttggccag atggcgactt 4380 

30 tctcctctcc agtcgccctc ccaggcacta .gcttttagga gtgcagggtg ctgcctctga 4440 
tagaagggcc aggagagagc aggttttgga gacctgatgt tataaggaac agcttgggag 4500 
gcataatgaa cccaacatga tgcttgagac caatgtcaca gcccaattct gacattcatc 4560 
atctgagatc tgaggacaca gctgtctcag ttcatgatct gagtgctggg aaagccaaga 4620 
cttgttccag ctttgtcact gacttgctgt atagcctcaa caaggccctg accctctctg 4680 

35 ggcttcaaac tcttcactgt gaaaggagga aaccagagta ggtgatgtga caccaggaaa 4740 
gatggatggg tgtgggggaa tgtgctcctc ccagctgtca ccccctcgcc accctccctg 4 800 
caccagcctc tccacctcct ttgagcccag aattcccctg tctaggaggg cacctgtctc 4860 
gtgcctagcc atgggaattc tccatctgtt ttgctacatt gaacccagat gccattctaa 4920 
ccaagaatcc tggctgggtg caggggctct cgcctgtaac cccagcactt tgggaggcca 4980 

40 aggcaggcgg atcaagaggt caggagttca agacctgcct ggccaacacg gtgaaacctc 5040 
agctctacta aaaatacaaa aattagccag gcgtggtggc acacgcctgt aatcccagct 5100 
atttgggaag ctgagacaga agaatttctt gaacccggga ggtggaggtt tcagtgagcc 5160 
gagatcacgc cactgcactc caccctggcg gataaagcga gactctgtct caaaaaaaac 5220 
ccaaaaacct atgttagtgt acagagggcc ccagtgaagt cttctcccag ccccactttg 5280 

45 cacaactggg gagagtgagg ccccaggacc agaggattct tgctaaaggc caagtggata 5340 
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gtgatggccc tgccaggcta gaagccacaa cctctggccc tgaggccact cagcatatct 5400 
agtgtcccca ccctgcagag gcccaactcc ctcctgacca ctgagccctg taatgacggg 5460 
ggaatttcca taagccatga aggactgcac aaagttcagt tgggagtgaa agagaaatta 5520 
aagggagatg gaaatataca gcactaattt tagcaccgtc ttcagttcta acaacactag 5580 
5 ctagctgaag aaaatacaaa catgtattat gtaatgtgtg gtctgttcca tttggattac 5640 
ttagaggcac gagggccaag gagaaaggtg gtggagagaa accagctttg cacttcattt 5700 
gttgctttat tggaaggaaa cttttaaaag tccaaggggg ttgaagaatc tcaatatttg 5760 
ttatttccag ctttttttct ccagtttttc atttcccaaa ttcaaggaca cctttttctt 5820 
tgtattttgt taagatgatg gttttggttt tgtgactagt agttaacaat gtggctgccg 5880 

10 ggcatattct cctcagctag gacctcagtt ttcccatctg tgaagacggc aggttctiacc 5940 
tagggggctg caggcaggtg gticcgaagcc tgggcatatc tggagtagaa ggatcactgt 6000 
ggggcagggc aggttctgtg ttgctgtgga tgacgttgac tttgaccatt gctcggcaga 6060 
gcctgctctc gctggttcag ccacaggccc caccactccc tattgtctca gccccgggta 612 0 
tgaaacatgt attcctcact ggcctatcac ctgaagcctt tgaatttgca acacctgcca 6180 

IS acccctccct caaaagagtt gccctctcta gatccttttg atgtaaggtt tggtgttgag 6240 
acttatttca ctaaattctc atacataaac atcactttat gtatgaggca aaatgaggac 6300 
cc^gggagatg aatgacttgt cctggctcat acacctggaa agtgacagag tcagattaga 6360 
tcctaggtct atctgaagtt aaaagaggtg tcttttcact tcccacctcc tccatctact 6420 
ttaaagcagc acaaacccct gctttcaagg agagatgagc gtctctaaag cccctgacag 6480 

20 caagagccca gaactgggac accattagtg acccagacgg caggtaagct gactgcagga 6540 
gcatcagcct attcttgtgt ctgggaccac agagcattgt ggggacagcc ccgtctcttg 6600 
ggaaaaaaac cctaagggct gaggatcctt gtgagtgttg ggtgggaaca gctcccagga 6660 
ggtttaatca cagcccctcc atgctctcta gctgttgcca ttgtgcaaga tgcatttccc 6720 
ttctgtgcag cagtttccct ggccactaaa tagtgggatt agatagaagc cctccaaggg 6780 

25 ctccagcttg acatgattct tgattctgat ctgacccgat tctgataatc gtgggcaggc 684 0 
ccattcctct tcttgtgcct cattttcttc ttttgtaaaa caatggctgt accatttgca 6900 
tcttagggtc attgcagatg aaagtgttgc tgtccagagc ctgggtgcag gacctagatg 696 0 
taggattctg gttctgctac ttcctcagtg acattgaata gctgacctaa tctctctggc 7020 
tttggtttct tcatctgtaa aagaaggata ttagcattag cacctcacgg gattgttaca 7080 

30 agaaagcaat gaattaacac atgtgagcac ggagaacagt gcttggcata tggtaagcac 7140 
tacgtacatt ttgctattct tctgattctt tcagtgttac tgatgtcggc aagtacttgg 7200 
cacaggctgg tttaataatc cctaggcact ttcacgtggt gtcaatccct gatcactggg 7260 
agtcatcatg tgccttgact cgggcctggc ccccccatct ctgtcttgca ggacaatgcc 7320 
gtcttctgtc tcgtggggca tcctcctgct ggcaggcctg tgctgcctgg tccctgtctc 7380 

35 cctggctgag gatccccagg gagatgctgc ccagaagaca gatacatccc accatgatca 744 0 
ggatcaccca accttcaaca agatcacccc caacctggct gagttcgcct tcagcctata 7500 
ccgccagctg gcacaccagt ccaacagcac caatatcttc ttctccccag tgagcatcgc 7560 
tacagccCtt gcaatgctct ccctggggac caaggctgac actcacgatg aaatcctgga 7620 
gggcctgaat ttcaacctca cggagattcc ggaggctcag atccatgaag gcttccagga 7680 

40 actcctccgt accctcaacc agccagacag ccagctccag ctgaccaccg gcaatggcct 774 0 
gttcctcagc gagggcctga agctagtgga taagtttttg gaggatgtta aaaagttgta 7800 
ccactcagaa gccttcactg tcaacttcgg ggacaccgaa gaggccaaga aacagatcaa 7860 
cgattacgtg gagaagggta ctcaagggaa aattgtggat ttggtcaagg agcttgacag 7920 
agacacagtt tttgctctgg tgaattacat cttctttaaa ggtaaggttg ctcaaccagc 7980 

45 ctgagctgtt tcccatagaa acaagcaaaa atatttctca aaccatcagt tcttgaactc 8040 
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tccttggcaa tgcattatgg gccatagcaa 
acacacaaac actaaaatgt tttccatcat 
tgtcaaaact actgacgctc tgcagaactt 
gtatatgtag aacatataat gcttagaact 
5 ggatagtttc atggaacata ctttacacga 
gcaatctgaa agacctgggt tcaaatcctg 
aatgcattct acctctttga ggtgctaatt: 
ttgctgtcag gtttttttag gattaaacaa 
atacagcagg tgatggactc ttctgtatct 

10 aatgccaggg tcctgggggc cccagggcat 
cagcataggg gaggggaggt gggaggcaag 
cccttgctct tgaggcaaag gagggcagtg 
aaagtcctgg ctctgctgtg gccttagtgg 
ctccttctgt ccaatgagaa agctgggatc 

15 atgcctcgat ggtgaagctc tgttggtatg 
ttcccctgcc aatctagagg atgaggaaag 
ctggatgaat aactgagctg ccaat.taaca 
aaatatgcct gatgctccaa cattttattt 
atgattccaa taaaaatgca cagccacgat 

20 tagaaattct gcatcttcct ctagttttga 
tggatggtca gtttcagcac cttttacaga 
cgtcaagggg gctcagggct tctcagggag 
cttccaaacc ttcactcacc cctggtgatg 
agagaccctt tgaagtcaag gacaccgagg 

25 ccgtgaaggt gcctatgatg aagcgtttag 
tgtccagctg ggtgctgctg atgaaatacc 
ctgatgaggg gaaactacag cacctggtaa 
tcctggaaaa tgaagacaga aggtgattcc 
cacctcttag ccatgttggg actgaggccc 

30 gaaccccaca tccctgggtc actgctactc 
ccactgagtt caggcagcgc cgtccatgct 
ggtaaaggtc tcgtccctgg gaacttccca 
tccagtgccc aaggcaggga cagcagcacc 
cagatgggct tcctggcaaa ggcagcagtg 

35 ggggggacgg gggaagagcc tgtgtggcaa 
cagccaggcc atgttcgcag aaggcttgcg 
ttgggtgtcc tgtagtaagt gatacttaaa 
attcgtgatt atgcccatgc cctgctgatc 
tgaaaataca aaaggtgtcg ggttcataat 

40 tgccagcacc tcctgaggtc tctgctccag 
gtaacggatc agagagccca gagccagctg 
attgtctccc catggaaaac cagagaggag 
atccactaaa cggttgtcac tgggcactgc 
ccctggggat gttacaggct gggggccagg 

45 ggcttccagg acacctagaa tatcagagga 
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tgcttttcag cgtggattct tcagttttct 8100 
tgagtaattt gaggaaataa tagattaaac 8160 
ttcagagcct ttaatgtcct tgtgtatact 8220 
atagaacaaa ttgtaataca ctgcataaag 8280 
ctctagtgtc ccagaatcag tatcagtttt 8340 
cctctaacac aattagcttt tgacaaaaac 8400 
tctcatctta gcatggacaa aataccattc 8460 
atgacaaaga ctgtggggat ggtgtgtggc 8520 
caggctgcct tcctgcccct gaggggttaa 8580 
tctaagccag ctcccactgt cccaggaaaa 8640 
gccaggggct gcttcctcca ctctgaggct 8700 
gaggcaagcc aggctgcagt cagcacagct 8760 
gggcccaggt ccctctccag ccccagtctc 882 0 
aggggtccct gaggcccctg tccactctgc 8880 
gcagagggga ggctgctcag gcatctgcat 8940 
ctctcaggaa tagtaagcag aatgtttgcc 9000 
aggggcaggg agccttagac agaaggtacc 9060 
gtaatatcca agacaccctc aaataaacat 9120 
ggcatctctt agcct:gacat cgccacgatg 9180 
attatcccca cacaatcttt ttcggcagct 9240 
tgatgaagct gagcctcgag ggatgtgtgt 9300 
gggactcatg gtttcttatt ctgctacact 9360 
cccaccttcc cctctctcca ggcaaatggg 9420 
aagaggactt ccacgtggac caggtgacca 9480 
gcatgtttaa catccagcac tgtaagaagc 9540 
tgggcaatgc caccgccatc ttcttcctgc 9600 
atgaactcac ccacgatatc atcaccaagt 9660 
ccaacctgag ggtgaccaag aagctgccca 9720 
atcaggactg gccagagggc tgaggagggt 9780 
tgtataaact tg'gcttccag aatgaggcca 9840 
ccatgaggag aacagtaccc agggtgagga 9900 
ctccagtgtg gacactgtcc cttcccaata 9960 
accacacgtt ctggcagaac caaaaaggaa 10020 
gagtgtggag ttcaagggta gaatgtccct 10080 
ggcccagaaa agcaaggttc ggaattggaa 10140 
tttctctgtc actttatcgg tgctgttaga 10200 
catgagccac acattagtgt atgtgtgtgc 10260 
tagttcgttt tgtacactgt aaaaccaaga 10320 
aggaatcgag gctggaattt ctctgttcca 10380 
gggttgagaa agaacaaaga ggctgagagg 10440 
ccgctcacac cagaccctgc tcagggtggc 10500 
cactcagcct ggtgtggtca ctcttctctt 10560 
caccagcccc gtgtttctct gggtgtiaggg 10620 
tgacccaaca ctacagggca agatgagaca 10680 
ggtggcattt caagcttttg tgattcattc 10740 
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gatgttaaca ttctttgact caatgtagaa 
ttcccatctt agtgtgggtg gaggacacag 
aacacttgca ctgtggtggg tcccagaaga 
ttcttttctg cacgacaggt ctgccagctt 
ctatgatctg aagagcgtcc tgggtcaact 
tgacctctcc ggggtcacag aggaggcacc 
acgaccttgt tgcaccatgg tatctgtagg 
tgaggctgag gaaggggccg agggaaacaa 
cccgtgattc actgacacgg gacggtgggc 
cagctggcac tttcggggcc tcccttgagg 
cccaagacct tctagacatt gggccttgat 
taagttgttt caaaggaatt tctgactcct 
tgaatgattt cagctaaagt gacacttatt 
ctgcagttcc atgaatggct gcattatctt 
gggtccgtgt cctcaaggag cttcaagccg 
gacgtggagt gacgatgctc ttccctgttc 
^ggcgcttgt caggaagatg gacagagggg 
ggagcaaggc ctatgtgaca gggagggaga 
agtgagcgct tcctgggagg tgtccacgtg 
caacgtgtct ct;gcttctct cccctccagg 
agaaagggac tgaagctgct ggggccatgt 
ccgaggtcaa gttcaacaaa ccctttgtct 
ccctcttcat gggaaaagtg gtgaatccca 
ccctcccctc catccctggc cccctccctg 
ctgcctgcat gtgatctgta aatccctggg 



gagctaaaag tagaacaaac caaagccgag 10800 
gagtaagtgg cagaaataat cagaaaagaa 10860 
acaagaggaa tgctgtgcca tgccttgaat 10920 
acatttaccc aaactgtcca ttactggaac 10980 
gggcatcact aaggtcttca gcaatggggc 11040 
cctgaagctc tccaaggtga gatcaccctg 11100 
gaagaatgtg tgggggctgc agcactgtcc 11160 
atgaagaccc aggctgagct cctgaagatg 11220 
aaacagcaaa gccaggcagg ggctgctgtg 11280 
ttgtgtcact gaccctgaat ttcaactttg 11340 
ttatccatac tgacacagaa aggtttgggc 11400 
tcgatctgtg agatttggtg tctgaattaa 114 60 
ttggaaaact aaaggcgacc aatgaacaac 11520 
ggggtctggg cactgtgaag gtcactgcca 11580 
tgtactagaa aggagagagc cctggaggca 1164 0 
tgagttgtgg gtgcacctga gcagggggag 11700 
agccagcccc atcagccaaa gccttgagga 11760 
ggatgtgcag ggccagggcc gtccaggggg 11820 
agccttgctc gaggcctggg atcagcctta 11880 
ccgtgcataa ggctgtgctg accatcgacg 11940 
ttttagaggc catacccatg tctatccccc 12000 
tcttaatgat tgaacaaaat . accaagtctc 12060 
cccaaaaata actgcctctc gctcctcaac 12120 
gatgacatta aagaagggtt gagctggtcc 12180 
atgttttctc tg 12222 



<210> 43 
<211> 1599 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 



<400> 43 

caccagcatc atctcctcca attcatccag 
9gcggattgc ctcagatcac actatctcca 
atgtattcca atgtgatagg aactgtaacc 
ttgctgctca ttggcttctg ggactgcgtg 
acagccaagc cgcgggacat tcccatgaat 
aaggcaactg aggatgaggg ctcagaacag 
tgggaactgt ccaaggccaa ttcccgcttt 
tccaagaatg acaatgataa cattttcctg 
atgaccaagc tgggtgcctg taatgacacc 
gacaccatat ctgagaaaac atctgatcag 



ctactctgcc catgaagata atagttttca 60 
cttgcccagc cctgtggaag attagcggcc 120 
tctggaaaaa ggaaggttta tcttttgtcc 180 
acctgtcacg ggagccctgt ggacatctgc 240 
cccatgtgca tttaccgctc cccggagaag 300 
aagatcccgg aggccaccaa ccggcgtgtc 3 60 
gctaccactt tctatcagca cctggcagat 42 0 
tcacccctga gtatctccac ggcttttgct 480 
ctccagcaac tgatggaggt atttaagttt 54 0 
atccacttct tctttgccaa actgaactgc 600 
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cgactctatc gaaaagccaa caaatcctcc aagttagtat cagccaatcg cctttttgga 660 
gacaaatccc ttaccttcaa tgagacctac caggacatca gtgagttggt atatggagcc 720 
aagctccagc ccctggactt caaggaaaat gcagagcaat ccagagcggc catcaacaaa 7 80 
tgggtgtcca ataagaccga aggccgaatc accgatgtca ttccctcgga agccatcaat 84 0 
5 gagctcactg ttctggtgct ggttaacacc atttacttca agggcctgtg gaagtcaaag 900 
ttcagccctg agaacacaag gaaggaactg ttctacaagg ctgatggaga gtcgtgttca 960 
gcatctatga tgtaccagga aggcaa:gttc cgttatcggc gcgtggctga aggcacccag 1020 
gtgcttgagt tgcccttcaa aggtgatgac atcaccatgg tcctcatctt gcccaagcct 1080 
gagaagagcc tggccaaggt ggagaaggaa ctcaccccag aggtgctgca ggagtggctg 114 0 

10 gatgaattgg aggagatgat gctggtggtt cacatgcccc gcttccgcat tgaggacggc 1200 
ttcagtttga aggagcagct gcaagacatg ggccttgtcg atctgttcag ccctgaaaag 1260 
tccaaactcc caggtattgt tgcagaaggc cgagatgacc tctatgtctc agatgcattc 1320 
cataaggcat ttcttgaggt aaatgaagaa ggcagtgaag cagctgcaag taccgctgtt 1380 
gtgattgctg gccgttcgct aaaccccaac agggtgactt tcaaggccaa caggcccttc 1440 

15 ctggttttta taagagaagt tcctctgaac actattatct tcatgggcag agtagccaac 1500 
ccttgtgtta agtaaaatgt tcttattctt tgcacctctt cctatttttg gtttgtgaac 1560 
agaagtaaaa ataaatacaa actacttcca tctcacatt 1599 

<210> 44 

20 <211> 23 

<212> DNA 

<213> T7 virus 

<400> 44 

25 taatacgact cactataggg cga 23 

<210> 45 
<211> 24 
<212> DNA 
30 <213> T3 virus 

<400> 45 

ttattaaccc tcactaaa.gg gaag 24 

35 <210> 46 
<211> 23 
<212> DNA 
<213> SP6 virus 

40 <400> 46 

atttaggtga cactatagaa tac 23 



-33- 



wo 00/75279 




PCT/USOO/15728 



<210> 47 
<211> 555 
<212> PRT 

<213> Mus musculus 

5 

<400> 47 

Met Thr Lys Ser Ser Asn His Asn Cys Leu Leu Arg Pro Glu Asn Lys 
15 10 15 

10 Pro Gly Leu Trp Gly Pro Gly Ala Gin Ala Ala Ser Leu Arg Pro Ser 

20 25 30 

Pro Ala Thr Leu Val Val Ser Ser Pro Gly His Ala Glu His Pro Pro 
35 40 45 

15 

Ala Ala Pro Ala Gin Thr Pro Gly Pro Gin Val Ser Ala Ser Ala Arg 
50 55 60 

Gly Pro Gly Pro Val Ala Gly Gly Ser Gly Arg Met Glu Arg Arg Met 
20 65 70 75 80 

Lys Gly Gly Tyr Leu Asp Gin Arg Val Pro Tyr Thr Phe Cys Ser Lys 
85 90 95 

25 Ser Pro Gly Asn Gly Ser Leu Gly Glu Ala Leu Met Val Pro Gin Gly 

100 105 110 



Lys Leu Met Asp Pro Gly Ser 
115 

30 

Phe Gin Asp Leu Ser His Phe 
130 135 

Val Pro Asp Ser Asp Glu Gin 
35 145 150 

Leu Ala Phe His Ser Pro Thr 
165 

40 Pro Arg Thr Asp Pro Ala Leu 

180 



Leu Pro Pro Ser Asp Ser Glu Asp Leu 
120 125 

Gin Glu Thr Trp Leu Ala Glu Ala Gin 
140 

Phe Val Pro Asp Phe His Ser Glu Asn 
155 160 

Thr Arg lie Lys Lys Glu Pro Gin Ser 
170 175 

Ser Cys Ser Arg Lys Pro Pro Leu Pro 
185 190 



Tyr His His Gly Glu Gin Cys Leu Tyr Ser Arg Gin lie A1& lie Lys 
195 200 205 

45 
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Ser Pro Ala Pro Gly Ala Pro Gly Gin Ser Pro Leu Gin Pro Phe Ser 
210 215 220 

Arg Ala Glu Gin Gin Gin Ser Leu Leu Arg Ala Ser Ser Ser Ser Gin 
5 225 230 235 240 

Ser His Pro Gly His Gly Tyr Leu Gly Glu His Ser Ser Val Phe Gin 
245 250 255 

10 Gin Pro Val Asp Met Cys His Ser Phe Thr Ser Pro Gin Gly Gly Gly 

260 265 270 

Arg Glu Pro Leu Pro Ala Pro Tyr Gin His Gin Leu Ser Glu Pro Cys 
275 280 285 



15 



30 



Pro Pro Tyr Pro Gin Gin Asn Phe Lys Gin Glu Tyr His Asp Pro Leu 
290 295 300 



Tyr Glu Gin Ala Gly Gin Pro Ala Ser Ser Gin Gly Gly Val Ser Gly 
20 305 310 315 320 

His Arg Tyr Pro Gly Ala Gly Val Val lie Lys Gin Glu Arg Thr Asp 
325 330 335 

25 Phe Ala Tyr Asp Ser Asp Val Pro Gly Cys Ala Ser Met Tyr Leu His 

340 345 350 

Pro Glu Gly Phe Ser Gly Pro Ser Pro Gly Asp Gly Val Met Gly Tyr 
355 360 365 



Gly Tyr Glu Lys Ser Leu Arg Pro Phe Pro Asp Asp Val Cys lie Val 
370 375 380 



Pro Lys I.ys Phe Glu Gly Asp lie Lys Gin Glu Gly lie Gly Ala Phe 

35 385 390 395 400 

Arg Glu Gly Pro Pro Tyr Gin Arg Arg Gly Ala Leu Gin Leu Trp Gin 

405 410 • 415 

40 Phe Leu Val Ala Leu Leu Asp Asp Pro Thr Asn Ala His Phe lie Ala 

420 425 430 

Trp Thr Gly Arg Gly Met Glu Phe Lys Leu lie Glu Pro Glu Glu Val 

433 440 445 

45 
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Ala Arg Leu Trp Qly lie Gin Lys Asn.Arg Pro Ala Met Asn Tyr Asp- 
450 455 460 

Lys Leu Ser Arg Ser Leu Arg Tyr Tyr Tyr Glu Lys Gly He Met Gin 
465 470 475 480 

Lys Val Ala Gly Glu Arg Tyr Val Tyr Lys Phe Val Cys Glu Pro Glu 
485 490 495 

Ala Leu Phe Ser Leu Ala Phe Pro Asp Asn Gin Arg Pro Ala Leu Lys 
500 505 510 

Ala Glu Phe Asp Arg Pro Val Ser Glu Glu Asp Thr Val Pro Leu Ser 
515 520 525 

His Leu Asp Glu Ser Pro Ala Tyr Leu Pro Glu Leu Thr Gly Pro Ala 
530 535 540 

Pro Pro Phe Gly His Arg Gly Gly Tyr Ser Tyr 
545 550 555 



<210> 48 

<211> 8 

<212> DNA 

<213> Artificial Secjuence 
<220> 

<223> Description of Artificial Sequence: Synthetic 

<400> 48 
cccggaag 

<210> 49 

<211> 1843 

<212> DNA 

<213> Homo sapiens 



<400> 49 

ctgcaggggg gggggggggg gggggctgtc 
tccacgaccc gcccctacag gtgccagtgc 
gctgttcgtg gccacctggg gaatttccgg 
cagcagcgag cgtgcccacc aggtgctgcg 
ggagctccgt cacagcagcc tggagcggga 
ggccaaggaa attttccaaa atgtggatga 



atggcggcag gacggcgaac ttgcagtatc 60 
ctccagaatg tggcagctca caagcctcct 120 
cacaccagct cctcttgact cagtgttctc 180 
gatccgcaaa cgtgccaact ccttcctgga 240 
gtgcatagag gagatctgtg acttcgagga 300 
cacactggcc ttctggtcca agcacgtcga 360 
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cggtgaccag tgcttggtct tgcccttgga gcacccgtgc gccagcctgt gctgcgggca 420 
cggcacgtgc atcgacggca tcggcagctt cagctgcgac tgccgcagcg gctgggaggg 480 
ccgcttctgc cagcgcgagg tgagcttcct caattgctcg ctggacaacg gcggctgcac 54 0 
gcattactgc ctagaggagg tgggctggcg gcgctgtagc tgtgcgcctg gctacaagct 600 
5 gggggacgac ctcctgcagt gtcaccccgc agtgaagttc ccttgtggga ggccctggaa 660 
gcggatggag aagaagcgca gtcacctgaa acgagacaca gaagaccaag aagaccaagt 720 
agatccgcgg ctcattgatg ggaagatgac caggcgggga gacagcccct ggcaggtggt 780 
cctgctggac tcaaagaaga agctggcctg cggggcagtg ctcatccacc cctcctgggt 840 
gctgacagcg gcccactgca tggatgagtc caagaagctc cttgtcaggc ttggagagta 900 

10 tgacctgcgg cgctgggaga agtgggagct ggacctggac atcaaggagg tcttcgtcca 960 
ccccaactac agcaagagca ccaccgacaa tgacatcgca ctgctgcacc tggcccagcc 1020 
cgccaccctc tcgcagacca tagtgcccat ctgcctcccg gacagcggcc ttgcagagcg 1080 
cgagctcaat caggccggcc aggagaccct cgtgacgggc tggggctacc acagcagccg 1140 
agagaaggag gccaagagaa accgcacctt cgtcctcaac ttcatcaaga ttcccgtggt 1200 

15 cccgcacaat gagtgcagcg aggtcatgag caacatggtg tctgagaaca tgctgtgtgc 1260 
gggcatcctc ggggaccggc aggatgcctg cgagggcgac agtggggggc ccatggtcgc 1320 
ctccttccac ggcacctggt tcctggtggg cctggtgagc tggggtgagg gctgtgggct 13 80 
ccttcacaac tacggcgttt acaccaaagt cagccgctac ctcgactgga tccatgggca 144 0 
catcagagac aaggaagccc cccagaagag ctgggcacct tagcgaccct ccctgcaggg 1500 

20 ctgggctttt gcatggcaat ggatgggaca ttaaagggac atgtaacaag cacaccggcc 1560 
tgctgttctg tccttccatc cctcttttgg gctcttctgg agggaagtaa catttactga 1620 
gcacctgttg tatgtcacat gccttatgaa tagaatccta actcctagag caactctgtg 1680 
gggtggggag gagcagatcc aagttttgcg gggtctaaag ctgtgtgtgt tgagggggat 1740 
actctgttta tgaaaaagaa taaaaaacac aaccacgaaa aaaaaaaaaa aaaaaaaaaa 1800 

25 aaaaaaaaaa aaaaaaaccc ccccccgccc cccccccctg cag 1843 

<210> 50 

<211> 11725 

<212> DNA 

30 <213> Homo sapiens 

<400> 50 

agtgaatctg ggcgagtaac acaaaacttg agtgtcctta cctgaaaaat agaggttaga 60 

gggatgctat gtgccattgt gtgtgtgtgt tgggggtggg gattgggggt gatttgtgag 120 

35 caattggagg tgagggtgga gcccagtgcc cagcacctat gcactgggga cccaaaaagg 180 

agcatcttct catgatttta tgtatcagaa attgggatgg catgtcattg ggacagcgtc 240 

ttttttcttg tatggtggca cataaataca tgtgtcttat aattaatggt attttagatt 300 

tgacgaaata tggaatatta cctgttgtgc tgatcttggg caaactataa tatctctggg 360 

caaaaatgtc cccatctgaa aaacagggac aacgttcctc cctcagccag ccactatggg 420 

40* gctaaaatga gaccacatct gtcaagggtt ttgccctcac ctccctccct gctggatggc 480 

atccttggta ggcagaggtg ggcttcgggc agaacaagcc gtgctgagct aggaccagga 540 

gtgctagtgc cactgtttgt ctatggagag ggaggcctca gtgctgaggg ccaagcaaat 600 

atttgtggtt atggattaac tcgaactcca ggctgtcatg gcggcaggac ggcgaacttg 660 

cagtatctcc acgacccgcc cctgtgagtc cccctccagg caggtctatg aggggtgtgg 720 

45 agggagggct gcccccggga gaagagagct aggtggtgat gagggctgaa tcctccagcc 780 
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agggtgctca acaagcctga gcttggggta 
ctggcagcca cagtctcagg tccctttgcc 
cccaggccca gggccattcc aacagacagt 
cccacttcca cctttggggg tgtcggattt 
5 gtcggcaaga atggagagca gggtccggta 
tggggagggt tccttgatcc ctggccacca 
tggtggtttg gggcaggggt tgaatttcca 
tcccggctct gcgagtaatg catggatgta 
ttccgagtct ggtgcctgca gtgtactgat 

10 gacagaatct gatcgatccc ctgggttggt 
aagggttgga tttttaataa accacttaac 
aatggggttg acagcattaa taactacctc 
taatatctca tgtttactga gcatgagcta 
ggactaactc ctttaattct cacaacaccc 

15 tccattttac aaatgaggaa actgaggcat 
cctccagtaa gtgctggagc tggaatttgc 
gtgaatcctg taaaaattgt ttgaaagaca 
ttctcagccc agtcatcaga ccggcagagg 
aaacatcctg gcaccctctc cactgcattc 

20 ctcagcccca cgtagagcgg gcagccgagg 
gaccctcaat tccagcttcc gcctgacggc 
ttccacctgg gggtgcaggc agagcagcag 
cctcctcaga caggtgccag tgcctccaga 
gtggccacct ggggaatttc cggcacacca 

25 ccccgggacc cttgtggcct ctacaaggcc 
tccaccatct ctctgagccc tgggtgaggt 
tgacaagtcc caggtaggcc agctgccaga 
gcgtgatggc agggagcccc gcgatgacct 
tcacagagtc ccctgggcct tccctctcca 

30 aggcccaggc taccgtccac actatccagc 
ctcatggctg ccctgcccca acccctttcc 
gattcttggg gaggtccgca ggcacatggg 
atttgtgcct ttatagagct gtttatctgc 
gtgccctcag ctcaggcata ccctcctcta 

35 acacccccaa cttgatctct ccctcctaac 

gagcccagga cacacctggg gacccttcct 
ccctgcccaa ggggagaagc atggggaata 
ggatgtgtca agatggggct gcatgtggtg 
ggcagccttt acagcagcag ccagggcttg 

40 atgttttaca tgacggtctc atccccatgt 
gtaaagacac tggctcaagg tcacacagag 
catctcagag caaggcttcg tcctccaact 
gaggacccct gcgccaagcc atgacctaga 
agaccttccc aggctctccc agctctgctt 

45 ttaggcccct caccaaggtg agctcccctc 



aaaggacaca aggccctcca caggccaggc 840 
atgcgcctcc ctctttccag gccaagggtc 900 
ttggagccca ggaccctcca ttctccccac 960 
gaacaaatct cagaagcggc ctcagaggga 1020 
g99tgtgcag aggccacgtg gcctatccac 1080 
gggctatctc tgtggccttt tggagcaacc 1140 
ggcctaaaac cacacaggcc tggccttgag 1200 
aacatggaga cccaggacct tgcctcagtc 1260 
ggtgtgagac cctactcctg gaggatgggg 1320 
gacttccctg tgcaatcaac ggagaccagc 1380 
tcctccgagt ctcagtttcc ccctctatga 1440 
ttgggtggtt gtgagcctta actgaagtca 1500 
tgtgcaaagc ctgttttgag agctttatgt 15*60 
tttaaggcac agatacacca cgttattcca 1620 
ggagcagtta agcatcttgc ccaacattgc 1680 
accgtgcagt ctggcttcat ggcctgccct 1740 
ccatgagtgt ccaatcaacg ttagctaata 1800 
cagccacccc actgtcccca gggaggacac 1860 
tggagctgct ttctaggcag gcagtgtgag 1920 
ccttctgagg ctatgtctct agcgaacaag 1980 
cagcacacag ggacagccct ttcattccgc 2040 
cgggggtagc actgcccgga gctcagaagt 2100 
atgtggcagc tcacaagcct cctgctgttc 2160 
gctcctcttg gtaaggccac cccaccccta 2220 
ctggtggcat ctgcccaggc cttcacagct 2280 
gaggggcaga tgggaatggc aggaatcaac 234 0 
gtgccacaca ggggctgcca gggcaggcat 2400 
cctaaagctc cctcctccac acggggatgg 2460 
cccactcact ccctcaactg tgaagacccc 2520 
acagcctccc ctactcaaat gcacactggc 2580 
tggtctccac agccaacggg aggaggccat 264 0 
cccctaaagc cacaccaggc tgttggtttc 2700 
ttgggacctg cacctccacc ctttcccaag 2760 
ggatgccttt tcccccatcc cttcttgctc 2820 
tgtgccctgc accaagacag acacttcaca 2 880 
gggtgatagg tctgtctatc ctccaggtgt 2940 
cttggttggg ggaggaaagg aagactgggg 3000 
tactggcaga agagtgagag gatttaactt 3060 
agtacttatc tctgggccag gctgtattgg 3120 
ttttggatga gtaaattgaa ccttagaaag 3180 
atcggggtgg ggttcacagg gaggcctgtc 3240 
gccatctgct tcctggggag gaaaagagca 3300 
attagaatga gtcttgaggg ggcggagaca 3360 
cctcagaccc cctcatggcc ccagcccctc 3420 
cctccaaaac cagactcagt gttctccagc 3480 
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agcgagcgtg cccaccaggt gctgcggatc 
ctccgtcaca gcagcctgga gcgggagtgc 
aaggaaattt tccaaaatgt ggatgacaca 
ctcaggggcg agctggtaac cagcaggggc 
5 ggccctctta ggagttgtgg gggtggctga 
gtcggccagg cacatgtgac tgcaagaaac 
gtggggtgac cctaggtggg gactcccaca 
cagccactgc tgagcaccac tgcctccccg 
caccctgctt ccacccatgc ctctgctgat 

10 tgtccacata aaatcgctca ctctgtgcct 
9g99gtcgga agacagggtc tgtgtcctat 
ccagagtcct gtggacgtgg ccctaggtag 
ctgagacaag gctcagaccc gctctgtccc 
aattgtgcac gcctgggccc ccttccaagg 

15 cagggcagga gaccctctgg cctgcaccct 
ggacccccat ctggacctcc .atccccacca 
accacagtga ctttctgcag gcacatatct 
ctcacccatg gtctctcagc cccagcagcc 
aggcacaggc cgtgggtctc aacgtgggct 

20 agcagcaacc ctggtacctg gttaggaacg 
gaaaaacact ggcttaggga aaggcgcgat 
agggagtgat gggactggaa ggaggccgag 
tgcagaggct gctgtgggag cggacagtcg 
gggtgttgct ccagggacgt gggatggagg 

25 ggggaggggc agggagcacc agctcctagc 
ttgtctggaa gccctcccct cccctgcccg 
gcccctccgc acaccggctg caggagcctg 
ctggtccaag cacgtcggtg agtgcgttct 
cccctcggga tctctggccg ctgaccccct 

30 tgcttggtct tgcccttgga gcacccgtgc 

atcgacggca tcggcagctt cagctgcgac 
cagcgcggtg agggggagag gtggatgctg 
tgggsrgcgcg gcaccagcac cagctgcccg 
ctcaattgct ctctggacaa cggcggctgc 

35 cggcgctgta gctgtgcgcc tggctacaag 

gcaggtgaga agcccccaat acatcgccca 
ccctgacggg cgcggcgcgg ggggctcagg 
gttgagcctt ggggcagcgg cagacgcgcc 
cccgggagct gggcgcgccc tccgctttcc 

40 cctccgggcg cccctgcgac ctggggccac 

gctccccagt ctgagcgtat ctggggcgag 
ctgcgttttt ctctgacgtt gtccggcgtg 
ccttgaggag agaacagaat cccgattctg 
ttaatcaaat ttatatatgt atgaaacttt 

45 tcagcatgct gttccttggc atgggccctt 



cgcaaacgtg ccaactcctt cctggaggag 3540 
atagaggaga tctgtgactt cgaggaggcc 3600 
gtaaggccac catgggtcca gaggatgagg 3660 
ctcgaggagc aggtggggac tcaatgctga 3720 
gtggagcgat taggatgctg gccctatgat 3780 
agaattcagg aagaagctcc aggaaagagt 3840 
gccacagtgt aggtggttca gtccaccctc 3900 
tcccacctca caaagagggg acctaaagac 3 960 
cagggtgtgt gtgtgaccga aactcacttc 4020 
cacatcaaag ggagaaaatc tgattgttca 4080 
ttgtctaagg gtcagagtcc tttggagccc 4140 
tagggtgagc ttggtaacgg ggctggcttc 4200 
tggggatcgc ttcagccacc aggacctgaa 4260 
catccaggga tgctttccag tggaggcttt 4320 
ctcttgccct cagcctccac ctccttgact 4380 
cctctttccc cagtggcctc cctggcagac 4440 
gatcacatca agtccccacc gtgctcccac 4500 
ttggctggcc tctctgatgg agcaggcatc 4 560 
gggtggtcct ggaccagcag cagccgccgc 4620 
cagaccctct gcccccatcc tcccaactct 4680 
gctcaggggt cccccaaagc ccgcaggcag 4740 
tgacttggtg agggattcgg gtcccttgca 4 800 
cgagagcagc actgcagctg catggggaga 4.860 
ctgggcgcgg gcgggtggcg ctggagggcg 4920 
agccaacgac catcgggcgt cgatccctgt 4980 
ctcacccgct gccctgcccc acccgggcgc 5040 
acgctgcccg ctctctccgc agctggcctt 5100 
agatccccgg ctggactacc ggcgcccgcg 5160 
accccgcctt gtgtcgcaga cggtgaccag 5220 
gccagcctgt gctgcgggca cggcacgtgc 5280 
tgccgcagcg gctgggaggg ccgcttctgc 534 0 
gcgggcggcg gggcggggct ggggccgggt 54 00 
cgccctcccc tgcccgcaga ggtgagcttc 5460 
acgcattact gcctagagga ggtgggctgg 5520 
ctgggggacg acctcctgca gtgtcacccc 5580 
ggaatcacgc tgggtgcggg gtgggcaggc 564 0 
agggtttcta gggagggagc gaggaacaga 5700 
caacaccggg gccactgtta gcgcaatcag 5760 
ctgcttcctt tcttcctggc gtccccgctt 5820 
ctcctggagc gcaagcccag tggtggctcc 5880 
gcgtgcagcg tcctcctcca tgtagcctgg 5940 
catcgcattt ccctctttac ccccttgctt 6000 
ccttcttcta tattttcctt tttatgcatt 6060 
aaaaatcaga gttttacaac tcttacactt 6120 
ttttcattca ttttcataaa aggtggaccc 6180 
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ttttaatgtg gaaattccta tcttctgcct ctagggcatt tatcacttat ttcttctaca 6240 

atctcccctt tacttcctct attttctctt tctggacctc ccattattca gacctctttc 6300 

ctctagtttt attgtctctt ctatttccca tctctttgac tttgtgtttt ctttcaggga 6360 

actttctttt ttttcttttt ttttgagatg gagtttcact cttgttgtcc caggctggag 6420 

5 tgcaatgacg tgatctcagc tcaccacaac ctccgcctcc tggattcaag cgattctcct 6480 

gccgcagcct cccgagtagc tgggattaca ggcatgcgcc accacgccca gctaattttg 6540 

tgtttttagt agagaagggg tttctccgtg ttggtcaagc tggtcttgaa ctcctgacct 6600 

caggtgatcc acctgccttg gcctcctaaa gtgctgggat tacaggcgtg agccaccgcg 6660 

cccagcctct ttcagggaac tttctacaac tttataattc aattcttctg cagaaaaaaa 6720 

10 tttttggcca ggctcagtag ctcagaccaa taattccagc actttgagag gctgaggtgg 6780 

gaggattgct tgagcttggg agttCgagac tagcctgggc aacacagtga gaccctgtct 6840 

ctatttttaa aaaaagtaaa aaaagatcta aaaatttaac tttttatttt gaaataatta 6900 

gatatttcca ggaagctgca aagaaatgcc tggtgggcct gttggctgtg ggtttcctgc 6960 

aaggccgtgg gaaggccctg tcattggcag aaccccagat cgtgagggct ttccttttag 7020 

IS gctgctttct aagaggactc ctccaagctc ttggaggatg gaagacgctc acccatggtg 7080 

ttcggcccct cagagcaggg tggggcaggg gagctggtgc ctgtgcaggc tgtggacatt 7140 

tgcatgactc cctgtggtca gctaagagca ccactccttc ctgaagcggg gcctgaagtc 7200 

cctagtcaga gcctctggtt caccttctgc aggcagggag aggggagtca agtcagtgag 7260 

gagggctttc gcagtttctc ttacaaactc tcaacatgcc ctcccacctg cactgccttc 7320 

20 ctggaagccc cacagcctcc tatggttccg tggtccagtc cttcagcttc tgggcgcccc 7380 

catcacgggc tgagattttt gctttccagt ctgccaagtc agttactgtg tccatccatc 7440 

tgctgtcagc ttctggaatt gttgctgttg tgccctttcc attcttttgt tatgatgcag 7500 

ctcccctgct gacgacgtcc cattgctctt ttaagtctag atatctggac tgggcattca 7560 

aggcccattt tgagcagagt cgggctgacc tttcagccct cagttctcca tggagtatgc 7620 

25 gctctcttct tggcagggag gcctcacaaa catgccatgc. ctatt:gtagc agctctccaa 7680 

gaatgctcac ctccttctcc ctgtaattqc tttcctctgt gaggagctca gcagcatccc 7740 

attatgagac cttactaatc ccagggatca cccccaacag ccctggggta caatgagctt 7800 

ttaagaagtt taaccaccta tgtaaggaga cacaggcagt gggcgatgct gcctggcctg 7860 

actcttgcca ttgggtggta ctgtttgttg actgactgac tgactgactg gagggggttt 7 920 

30 gtaatttgta tctcagggat tacccccaac agccctgggg tacaatgagc cttcaagaag 7980 

tttaacaacc tatgtaagga cacacagcca gtgggtgatg ctgcctggtc tgactcttgc 8040 

cattcagtgg cactgtttgt tgactgactg actgactgac tggctgactg gagggggttc 8100 

atagctaata ttaatggagt ggtctaagta tcattggttc cttgaaccct gcactgtggc 8160 

aaagtggccc acaggctgga ggaggaccaa gacaggaggg cagtctcggg aggagtgcct 8220 

35 ggcaggcccc tcaccacctc tgcctacctc agtgaagfttc ccttgtggga ggccctggaa 8280 

gcggatggag aagaagcgca gtcacctgaa acgagacaca gaagaccaag aagaccaagt 8340 

agatccgcgg ctcattgatg ggaagatgac caggcgggga gacagcccct ggcaggtggg 8400 

aggcgaggca gcaccggctc gtcacgtgct gggtccggga tcactgagtc catcctggca 8460 

gctatgctca gggtgcagaa accgagaggg aagcgctgcc attgcgtttg ggggatgatg 8520 

40 aaggtggggg atgcttcagg gaaagatgga cgcaacctga ggggagagga gcagccaggg 8580 

tgggtgaggg gaggggcatg ggggcatgga ggggtctgca ggagggaggg ttacagtttc 8.640 

taaaaagagc tggaaagaca -ctgctctgct ggcgggattt taggcagaag ccctgctgat 8700 

gggagagggc taggagggag ggccgggcct gagtacccct ccagcctcca catgggaact 8760 

gacactcact gggttcccct ctctgccagg catgggggag ataggaacca acaagtggga 882 0 

45 gtatttgccc tggggactca gactctgcaa gggtcaggac cccaaagacc cggcagccca 8880 
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gtgggaccac agccaggacg gcccttcaag ataggggctg agggaggcca aggggaacat 8940 
ccaggcagcc tgggggccac aaagtcttcc tggaagacac aaggcctgcc aagcctctaa 9000 
ggatgagagg agctcgctgg gcgatgttgg tgtggctgag ggtgactgaa acagtatgaa 9060 
cagtgcagga acagcatggg caaaggcagg aagacaccct gggacaggct gacactgtaa 9120 
5 aatgggcaaa aatagaaaac gccagaaagg cctaagccta tgcccatatg accagggaac 9180 

ccaggaaagt gcatatgaaa cccaggtgcc ctggactgga ggctgtcagg aggcagccct 9240 
gtgatgtcat catcccaccc cattccaggt ggtcctgctg gactcaaaga agaagctggc 9300 
ctgcggggca gtgctcatcc acccctcctg ggtgctgaca gcggcccact gcatggatga 9360 
gtccaagaag ctccttgtca ggcttggtat gggctggagc caggcagaag ggggctgcca 9420 

10 gaggcctggg tagggggacc aggcaggctg ttcaggtttg ggggaccccg ctccccaggt 9480 
gcttaagcaa gaggcttctt gagctccaca gaaggtgttt ggggggaaga ggcctatgtg 9540 
cccccaccct gcccacccat gtacacccag tattttgcag tagggggttc tctggtgccc 9600 
tcttcgaatc tgggcacagg tacctgcaca cacatgtttg tgaggggcta cacagacctt 9660 
cacctctcca ctcccactca tgaggagcag gctgtgtggg cctcagcacc cttgggtgca 9720 

15 gagaccagca aggcctggcc tcagggctgt gcctcccaca gactgacagg gatggagctg 9780 
tacagaggga gccctagcat ctgccaaagc cacaagctgc ttccctagca ggctgggggc 9840 
tcctatgcat tggccccgat ctatggcaat ttctggaggg ggggtctggc tcaactcttt 9900 
atgccaaaaa gaaggcaaag catattgaga aaggccaaat tcacatttcc tacagcataa 9960 
tctatgccag tggccccgtg gggcttggct tagaattccc aggtgctctt cccagggaac 10020 

20 catcagtctg gactgagagg accttctctc tcaggtggga cccggccctg tcctccctgg 10080 
cagtgccgtg ttctgggggt cctcctctct gggtctcact gcccctgggg tctctccagc 10140 
tacctttgct ccatgttcct ttgtggctct ggtctgtgtc tggggtttcc aggggtctcg 102 00 
ggcttccctg ctgcccattc cttctctggt ctcacggctc cgtgactcct gaaaaccaac 10260 
cagcatccta cccctttgga ttgacacctg ttggccactc cttctggcag gaaaagtcac 10320 

25 cgttgatagg gttccacggc atagacaggt ggctccgcgc cagtgcctgg gacgtgtggg 10380 
tgcacagtct ccgggtgaac cttcttcagg ccctctccca ggcctgcagg ggcacagcag 10440 
tgggtgggcc tcaggaaagt gccactgggg agaggctccc cgcagcccac tctgactgtg 10500 
ccctctgccc tgcaggagag tatgacctgc ggcgctggga gaagtgggag ctggacctgg 10560 
acatcaagga ggtcttcgtc caccccaact acagcaagag caccaccgac aatgacatcg 10620 

30 cactgctgca cctggcccag cccgccaccc tctcgcagac catagtgccc atctgcctcc 10680 
cggacagcgg ccttgcagag cgcgagctca atcaggccgg ccaggagacc ctcgtgacgg 10740 
gctggggcta ccacagcagc cgagagaagg aggccaagag aaaccgcacc ttcgtcctca 10800 
acttcatcaa gattcccgtg gtcccgcaca atgagtgcag cgaggtcatg agcaacatgg 10860 
tgtctgagaa catgctgtgt gcgggcatcc tcggggaccg gcaggatgcc tgcgagggcg 10920 

35 acagtggggg gcccatggtc gcctccttcc acggcacctg gttcctggtg ggcctggtga 10980 
gctggggtga gggctgtggg ctccttcaca actacggcgt ttacaccaaa gtcagccgct 11040 
acctcgactg gatccatggg cacatcagag acaaggaagc cccccagaag agctgggcac 11100 
cttagcgacc ctccctgcag ggctgggctt ttgcatggca atggatggga cattaaaggg 11160 
acatgtaaca agcacaccgg cctgctgttc tgtccttcca tccctctttt gggctcttct 11220 

40 ggagggaagt aacatttact gagcacctgt tgtatgtcac atgccttatg aatagaatct 11280 
taactcctag agcaactctg tggggtgggg aggagcagat ccaagttttg cggggtctaa 11340 
agctgtgtgt gttgaggggg atactctgtt tatgaaaaag aataaaaaac acaaccacga 11400 
agccactaga gccttttcca gggctttggg aagagcctgt gcaagccggg gatgctgaag 11460 
gtgaggcttg accagctttc cagctagccc agctatgagg tagacatgtt tagctcatat 11520 

45 cacagaggag gaaactgagg ggtctgaaag gtttacatgg tggagccagg attcaaatct 11580 
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aggtctgact ccaaaaccca ggtgcttttt tctgttctcc actgtcctgg aggacagctg 11640 
tttcgacggt gctcagtgtg gaggccacta ttagctctgt agggaagcag ccagagaccc 11700 
agaaagtgtt ggttcagccc agaat 11725 

<210> 51 
<211> 32 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 51 

ttattttata tatataatat atatataaaa ta '32 



<210> 52 
<211> 9 
<212> DNA 

<213> Artificial Secjuence 



<220> 

<223> Description of Artificial Secjuence: Synthetic 
<400> 52 

tataatata 9 



<210> 53 
<211> 16 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 53 

caatataaat atatag 16 

<210> 54 
<211> 44 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
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<400> 54 

t9tgtgtgta tgcgtgtgtg tagacacaca cgcatacaca cata 44 



<210> 55 
5 <211> 41 

<212> DNA 

<213> Artificial Sequence 
<220> 

10 <223> Description of Artificial Sequence: Synthetic 
<400> 55 

ttattttata tatataatat atatataaaa tatataatat a 41 

IS <210> 56 

<211> 25 
<212> DNA 

<213> Artificial Sequence 
20 <220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 56 

tataatatac aatataaata tatag 25 

25 

<210> 57 
<211> 60 
<212> DNA 

<213> Artificial Sequence 

30 

<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 57 

35 caatataaat atatagtgtg tgtgtatgcg tgtgtgtaga cacacacgca tacacacata 60 

<210> 58 
<211> 101 
<212> DNA 
40 <213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Synthetic 
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<400> 58 

ttattttata t.atataatat atatataaaa tatataatat acaatataaa tatatagtgt 60 
gtgtgtatgc gtgtgtgtag acaca'cacgc atacacacat a 101 

5 <210> 59 
<211> 723 
<212> DNA 

<213> Homo sapiens 
10 <400> 59 

tttaggaaaa ttgttaagac tgggttgtat gcacactggt gtttattata ttatgtagtt 60 
ttttctgtat ttttacaaca tttcagaatt aaaagcaaca gctagaaaaa gagggaaatg 120 
gccgggtgca gtcgtcacgc ctgtaatccc agcactttgg gaggccaagg cgggcggatc 180 
acgaggtcga gagatcgaga ccatcctggc caacatggtg aaaccccatc tctactaaaa 240 

15 atacaaaaat taactgggca tggtggcatg cgcctgtagt cccaggagaa ttgcttgaac 300 
ctgggaggcg gaggttgcag tgagccaaga tctcaccact gctctccagc ctggtgacag 360 
ggcaagactc cgtcaaaaaa aaaaaagaga gggagagcca gagtatgaaa aaggaagtca 420 
gagcccttta atgagtcagc tttgtaggtc tccaggtagg aggctagtgc ttcagtgtct 480 
aggacatagt aggtgttcag taaattaaat tcaggacaaa aagaacatgc cccaaggacc 540 

20 atctgatatc cacttaaagt gatggactac ctcgtttccc ttgtttatga atgggttcat. 600 
gcctaagact gtgtgcactt taatacaagg gcagtcgttc agaactagtc aggtcctgaa 660 
aaggatttac caaatgttga gtgtgccctc tagtgttcac acttcccagc tttcttccta 720 
taa 723 

25 <210> 60 

<211> 447 
<212> DNA 

<213> Homo sapiens 
30 <400> 60 

aatatccagt tgagataatg gacttgcctc ttatctaata atacccaggc tcaatgcgtc 60 
actgctttgt ccactttgcc caaaattcaa gcacagctaa gttgatattt taggacaaag 120 
gcagcttact atccagccag aggggagtag aatatggtta agagagagtg gaaagaatga 180 
atgagccctg ctattcctca ctgcctggat ggctataagc acagccctta tggaggcctt 240 
35 aggtcttgct tcacaatatt ccagtttgaa aagggtttga aaagacctcc tagaaaaatc 300 
agtagttttt ctcttttgag taacatgtag caaaaaaaat ttcatcatgt aggtacaggg 3 60 
aacaccctag taactattaa tctcaaggag tcaagccagt gtgtttccta atgtatctgc 420 
tgtatcccca tgaagcaaat tttgcca 447 

40 <210> 61 

<211> 103 
<212> DNA 

<213> Homo sapiens 
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<400> 61 

aggtggatca aggcacttgc ttacaactgg aactgaaatc ctccaagtcg atctagacat 60 

tgagatggag aaaatattca ttgtcgactg taattatgca acg 103 

5 <210> 62 

<211> 7 
<212> DNA 

<:213> Artificial Sequence 
10 <220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 62 

gtggaag 7 

15 

<210> 63 
<211> 7 
<212> DNA 

<213> Artificial Sequence 

20 

<220> 

<223> Description of Artificial Sec[uence: Synthetic 
<400> 63 

25 9tggatg 7 

<210> 64 
<211> 7 
<212> DNA 
30 <213> Artificial Secjuence 

<220> 

<223> Description of Artificial Sec[uence: Synthetic 
35 <400> 64 

ctggaag 7 

<210> 65 
<211> 7 
40 <212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Synthetic 

45 



-45- 



wo 00/75279 PCTAJSOO/15728 
<400> 65 

ctggatg . ^. 7 

<210> 66 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 66 

acccattcag tcgaggaagg atagggtggt at 32 

<210> 67 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; Synthetic 
<400> 67 

agccattgag tcgaggaagg atagggtggt at 32 

<210> 68 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 68 

agccattcag acgaggaagg atagggtggt at 32 

<210> 69 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
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agccattcag tcgaggaagg atagggtggt tt 
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32 



<210> 70 
<2X1> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 70 

agccattcag tcgaggatcc caagggtggt at 32 

<210> 71 
<211> 32 
<212> DNA 

<213> Artificial Seq[uence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 71 

agccattcag tcgaggaagg atagggccta at 32 

<210> 72 - 
<211> 33 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 72 

agaccattca gtcgaggaag gatagggtgg tat 33 

<210> 73 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
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agccattcag tcgaggaagg atagcggtgg tat 
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33 



<210> 74 
5 <211> 25 

<212> DNA 

<213> Artificial Sequence 
<220> 

10 <223> Description of Artificial Sequence: Synthetic 
<400> 74 

agccattcag tcgaggaagg ataat 25 

15 <210> 75 

<211> 28 
<212> DNA 

<213> Artificial Sequence 
20 <220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 75 . 

agccattcga ggaaggatag ggtggtat 28 

25 

<210> 76 
<211> 1273 
<212> DNA 

<213> Artificial Sec[uence 

30 

<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 76 

35 ggcctctgac taactaatca ctttcccatc ttttgttaga tttgaatata tacattctat 60 
gatcattgct ttttctcttt acaggggaga atttcatatt ttacctgagc aaattgatta 120 
gaaaatggaa ccactagagg aatataatgt gttaggaaat tacagtcatt tctaagggcc 180 
cagcccttga caaaattgtg aagttaaatt ctccactctg tccatcagat actatggttc 240 
tccactatgg caactaactc actcaatttt ccctccttag cagcattcca tcttcccgat 300 

40 cttctttgct tctccaacca aaacatcaat gtttattagt tctgtataca gtacaggatc 360 
tttggtctac tctatcacaa ggccagtacc acactcatga agaaagaaca caggagtagc 420 
tgagaggcta aaactcatca aaaacactac tccttttcct ctaccctatt cctcaatctt 4 80 
ttaccttttc caaatcccaa tccccaaatc agtttttctc tttcttactc cctctctccc 540 
ttttaccctc catggtcgtt aaaggagaga tggggagcat cattctgtta tacttctgta 600 

45 cacagttata catgtctatc aaacccagac ttgcttccat agtggagact tgcttttcag 660 
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aacataggga tgaagtaagg tgcctgaaaa gtttggggga aaagtttctt tcagagagtt 720 

aagttatttt atatatataa tatatatata aaatatataa tatacaatat aaatatatag 780 

tgtgtgtgtg tatgcgtgtg tgtagacaca cacgcataca cacatataat ggaagcaata 840 

agccattcta agagcttgta tggttatgga ggtctgacta ggcatgattt cacgaaggca 900 

5 agattggcat atcattgtaa ctaaaaaagc tgacattgac ccagacatat tgtactcttt 960 

ctaaaaataa taataataat gctaacagaa agaagagaac cgttcgtttg caatctacag 1020 

ctagtagaga ctttgaggaa gaattcaaca gtgtgtcttc agcagtgttc agagccaagc 1080 

aagaagttga agttgcctag accagaggac ataagtatca tgtctccttt aactagcata 1140 

ccccgaagtg gagaagggtg cagcaggctc aaaggcataa gtcattccaa tcagccaact 1200 

10 aagttgtcct tttctggttt cgtgttcacc atggaacatt ttgattatag ttaatccttc 1260 

tatcttgaat ctt 1273 

<210> 77 
<211> 1273 
15 <212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Synthetic 

20 

<400> 77 

ggcctctcac taactaatca ctttcccatc ttttgttaga tttgaatata tacattctat 60 

gatcattgct ttttctcttt acaggggaga atttcatatt ttacctgagc aaattgatta 120 

gaaaatggaa ccactagagg aatataatgt gttaggaaat tacagtcatt tctaagggcc 180 

25 cagcccttga caaaattgtg aagttaaatt ctccactctg tcc.atcagat actatggttc 240 

tccactatgg caactaactc actcaaattt ccctccttag cagcattcca tcttcccgat 300 

cttctttgct tctccaacca aaacatcaat gtttattagt tctgtataca gtacaggatc 360 

tttggtctac tctatcacaa ggccagtacc acactcatga agaaagaaca caggagtagc 420 

tgagaggcta aaactcatca aaaacactac tccttttcct ctaccctatt cctcaatctt 480 

30 ttaccttttc caaatcccaa tccccaaatc agtttttctc tttcttactc cctctctccc 540 

ttttaccctc catggtcgtt aaaggagaga tggggagcat cattctgtta tacttctgta 600 

cacagttata catgtctatc aaacccagac ttgcttccat agtggagact tgcttttcag 660 

aacataggga tgaagtaagg tgcctgaaaa gtttggggga aaagtttctt tcagagagtt 720 

aagttatttt atatatataa tatatatata aaatatataa tatacaatat aaatatatag 780 

35 tgtgtgtgtg tatgcgtgtg tgtagacaca cacgcataca cacatataat ggaagcaata 84 0 

agccattcta agagcttgta tggttatgga ggtctgacta ggcatgattt cacgaaggca 900 

agattggcat atcattgtaa ctaaaaaagc tgacattgac ccagacatat tgtactcttt 960 

ctaaaaataa taataataat gctaacagaa agaagagaac cgttcgtttg caatctacag 1020 

ctagtagaga ctttgaggaa gaattcaaca gtgtgtcttc agcagtgttc agagccaagc 1080 

40 aagaagttga agttgcctag accagaggac ataagtatca tgtctccttt aactagcata 1140 

ccccgaagtg gagaagggtg cagcaggctc aaaggcataa gtcattccaa tcagccaact 1200 

aagttgtcct t€tctggttt cgtgttcacc atggaacatt ttgattatag ttaatccttc 1260 

tatcttgaat ctt 1273 
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<210> 78 
<211> 1273 
<212> DNA 

<213> Artificial Sequence 

. 5 

<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 78 

10 ggcctctcac taactaatca ctttcccatc ttttgttaga tttgaatata tacattctat 60 

gatcattgct ttttctcttt acaggggaga atttcatatt ttacctgagc aaattgatta 120 

gaaaatggaa ccactagagg aatataatgt gttaggaaat tacagtcatt tctaagggcc 180 

cagcccttga caaaattgtg aagttaaatt ctccactctg tccatcagat actatggttc 240 

tccactatgg caactaactc actcaatttt ccctccttag cagcattcca tcttcccgat 300 

IS cttctttgct tctccaacca aaacatcaat gtttattagt tctgtataca gtacaggatc 360 

tttggtctac tctatcacaa ggccagtacc acactcatga agaaagaaca caggagtagc 420 

tgagaggcta aaactcatca aaaacactac tccttttcct ctaccctatt cctcaatctt 480 

ttaccttttc caaatcccaa tccccaaatc agtttttctc tttcttactc cctctctccc 540 

ttttaccctc catggtcgtt aaaggagaga tggggacgat cattctgtta tacttctgta 600 

20 cacagttata catgtctatc aaacccagac ttgcttccat agtggagact tgcttttcag 660 

aacataggga tgaagtaagg tgcctgaaaa gtttggggga aaagtttctt tcagagagtt 720 

aagttatttt atatatataa tatatatata aaatatataa tatacaatat aaatatatag 780 

■ tgtgtgtgtg tatgcgtgtg tgtagacaca cacgcataca cacatataat ggaagcaata 840 

agccattcta agagcttgta tggttatgga ggtctgacta ggcatgattt cacgaaggca 900 

25 agattggcat atcattgtaa ctaaaaaagc tgacattgac ccagacatat tgtactcttt 960 

ctaaaaataa taataataat gctaacagaa agaagagaac cgttcgtttg caatctacag 1020 

ctagtagaga ctttgaggaa gaattcaaca gtgtgtcttc agcagtgttc agagccaagc 1080 

aagaagttga agttgcctag accagaggac ataagtatca tgtctccttt aactagcata 1140 

ccccgaagtg gagaagggtg cagcaggctc aaaggcataa gtcattccaa tcagccaact 1200 

30 aagttgtcct tttctggttt cgtgttcacc atggaacatt ttgattatag ttaatccttc 1260 

tatcttgaat ctt 1273 

<210> 79 . . 
<211> 1273 
35 <212> DNA 

<213> Artificial Sequence 



40 



<220> 

<223> Description of Artificial Sequence: Synthetic 



<400> 79 

ggcctctcac taactaatca ctttcccatc ttttgttaga tttgaatata tacattctat 60 

gatcattgct ttttctcttt acaggggaga atttcatatt ttacctgagc aaattgatta 120 

gaaaatggaa ccactagagg aatataatgt gttaggaaat tacagtcatt tctaagggcc 180 

45 cagcccttga caaaattgtg aagttaaatt ctccactctg tccatcagat actatggttc 240 
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tccactatgg caactaactc actcaatttt ccctccttag cagcattcca tcttcccgat 300 

cttctttgct tctccaacca aaacatcaat gtttattagt tctgtataca gtacaggatc 360 

tttggtctac tctatcacaa ggccagtacc acactcatga agaaagaaca caggagtagc 420 

tgagaggcta aaactcatca aaaacactac tccttttcct ctaccctatt cctcaatctt 4 80 

5 ttaccttttc caaatcccaa tccccaaatc agtttttctc tttcttactc cctctctccc 540 

ttttaccctc catggtcgtt aaaggagaga tggggagcat cattctgtta tacttctgta 600 

cacagttata catgtctatc aaacccagac ttgcttccat agtggagact tgcttttcag 660 

aacataggga tgaagtaagg tgcctgaaaa gtttggggga aaagtttctt tcagagagtt 720 

aagttatttt atatatataa tatatatata aaatatataa tatacaatat aaatatatag 780 

10 tgtgtgtgtg tatgcgtgtg tgtagacaca cacgcataca cacatataat ggaagcaata 84 0 

agccattcta agagcttgta tggttatgga ggtctgacta ggcatgattt cacgaaggca 900 

agattggcat atcattgtaa ctaaaaaagc tgacattgac ccagacatat tgtactcttt 960 

ctaaaaataa taataataat gctaacagaa agaagagaac cgttcgtttg caatctacag 1020 

ctagtagaga ctttgaggaa gaattcaaca gtgtgtcttc agcagtgttc agagccaagc 1080 

15 aagaagttga agttgcctag accagaggac ataagtatgt actctccttt aactagcata 1140 

ccccgaagtg gagaagggtg cagcaggctc aaaggcataa gtcattccaa tcagccaact 12 00 

aagttgtcct tttctggttt cgtgttcacc atggaacatt ttgattatag ttaatccttc 1260 
tatcttgaat ctt 1273 

20 <210> 80 

<211> 1275 
<212> DNA 

<213> T^tificial Sequence 
25 <220> 

<223> Descripcion of Artificial Sequence: Synthetic 



<400> 80 

ggcctctcac taactaatca ctttcccatc ttttgttaga tttgaatata tacattctat 60 

30 gatcattgct ttttctcttt acaggggaga atttcatatt ttacctgagc aaattgatta 120 
gaaaatggaa ccactagagg aatataatgt gttaggaaat tacagtcatt tctaagggcc 180 
cagcccttga caaaattgtg aagttaaatt ctccactctg tccatcagat actatggttc 240 
tccactatgg caactaactc actcaatttt ccctccttag cagcattcca tcttcccgat 300 
ttcttctttg cttctccaac caaaacatca atgtttatta gttctgtata cagtacagga 360 

35 tctttggtct actctatcac aaggccagta ccacactcat gaagaaagaa cacaggagta 420 

gctgagaggc taaaactcat caaaaacact actccttttc ctctacccta ttcctcaatc 480 
ttttaccttt tccaaatccc aatccccaaa tcagtttttc tctttcttac tccctctctc 540 
ccttttaccc tccatggtcg ttaaaggaga gatggggagc atcattctgt tatacttctg 600 
tacacagtta tacatgtcta tcaaacccag acttgcttcc atagtggaga cttgcttttc 660 

40 agaacatagg gatgaagtaa ggtgcctgaa aagtttgggg gaaaagtttc tttcagagag 72 0 
ttaagttatt ttatatatat aatatatata taaaatatat aatatacaat ataaatatat 780 
agtgtgtgtg tgtatgcgtg tgtgtagaca cacacgcata cacacatata atggaagcaa 840 
taagccattc taagagcttg tatggttatg gaggtctgac taggcatgat ttcacgaagg 900 
caagattggc atatcattgt aactaaaaaa gctgacattg acccagacat attgtactct 960 

45 ttctaaaaat aataataata atgctaacag aaagaagaga accgttcgtt tgcaatctac 102 0 
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agctagtaga gactttgagg aagaattcaa cagtgtgtct tcagcagtgt tcagagccaa 1080 

gcaagaagtt gaagttgcct agaccagagg acataagtat catgtctcct ttaactagca 1140 

taccccgaag tggagaaggg tgcagcaggc tcaaaggcat aagtcattcc aatcagccaa 1200 

ctaagttgtc cttttctggt ttcgtgttca ccatggaaca ttttgattat agttaatcct 1260 

tctatcttga atctt 1275 

<210> 81 

<211> 1276 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Synthetic 
<400> 81 

ggcctctcac taactaatca ctttcccato ttttgttaga tttgaatata tacattctat 60 

gatcattgct ttttctcttt acaggggaga atttcatatt ttacctgagc aaattgatta 120 

gaaaatggaa ccactagagg aatataatgt gttaggaaat tacagtcatt tctaagggcc 180 

cagcccttga caaaattgtg aagttaaatt ctccactctg tccatcagat actatggttc 240 

tccactatgg caactaactc actcaatttt ccctccttag cagcattcca tcttcccgat 300 

cttctttgct tctccaacca aaacatcaat gtttattagt tctgtataca gtacaggatc 360 

tttggtctac tctatcacaa ggccagtacc acactcatga agaaagaaca caggagtagc 420 

tgagaggcta aaactcatca aaaacactac tccttttcct ctaccctatt cctcaatctt 480 

ttaccttttc caaatcccaa tccccaaatc agtttttctc tttcttactc cctctctccc 540 

ttttaccctc catggtcgtt aaaggagaga tggggagcat cattctgtta tacttctgta 600 

cacagttata catgtctatc aaacccagac ttgcttccat agtggagact tgcttttcag 660 

aacataggga tgaagtaagg tgcctgaaaa gtttggggga aaagtttctt tcagagagtt 720 

aagttatttt atatatataa tatatatata aaatatataa tatacaatat aaatatatag 780 

tgtgtgtgtg tatgcgtgtg tgtagacaca cacgcataca cacatataat ggaagcaata 840 

agccattcta agagcttgta tggttatgga ggtctgacta ggcatgattt cacgaaggca 900 

agattggcat atcattgtaa ctaaaaaagc tgacattgac ccagacatat tgtactcttt 960 

ctaaaaataa taataataat gctaacagaa agaagagaac cgttcgtttg caatctacag 1020 

ctagtagaga ctttgaggaa gaattcaaca gtgtgtcttc agcagtgttc agagccaagc 1080 

aagaagttga agttgcctag accagaggac ataagtatca tgtctccttt aactagcata 1140 

ccccgaagtg gagaagggtg cagcaggctc aaaggcataa gtcattccaa tcagccaact 1200 

gctaagttgt ccttttctgg tttcgtgttc accatggaac attttgatta tagttaatcc 1260 
ttctatcttg aatctt 1276 

<210> 82 
<211> 1272 
<212> DNA 

<213> Artificial Sequence 
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<223> Description of Artificial Sequence: Synthetic 
<400> 82 

5 ggcctctcac taactaatca ctttcccatc ttttgttaga tttgaatata tacattctat 60 

gatcattgct ttttctcttt a.caggggaga atttcatatt ttacctgagc aaattgatta 120 

gaaaatggaa ccactagagg aatataatgt gttaggaaat tacagtcatt tctaagggcc 180 

cagcccttga caaaattgtg aagttaaatt ctccactctg tccatcagat actatggttc 240 

tccactatgg caactaactc actcaatttt ccctccttag cagcattcca tcttcccgat 300 

10 cttctttgct tctccaacca aaacatcaat gtttattagt tctgtataca gtacaggatc 360 

tttggtctac tctatcacaa ggccagtacc acactcatga agaaagaaca caggagtagc 420 

tgagaggcta aaactcatca aaaacactac tccttttcct ctaccctatt cctcaatctt 480 

ttaccttttc caaatcccaa tccccaaatc agtttttctc tttcttactc cctctctccc 540 

ttttaccctc catggtcgtt aaaggagaga tggggagcat cattctgtta tacttctgta 600 

15 cacagttata catgtctatc aaacccagac ttgcttccat agtggagact tgcttttcag 660 

aacataggga tgaagtaagg tgcctgaaaa gtttggggga aaagtttctt tcagagagtt 720 

aagttatttt atatatataa tatatatata aaatatataa tatacaatat aaatatatag 780 

tgtgtgtgtg tatgcgtgtg tgtagacaca cacgcataca cacatataat ggaagcaata 840 

agccattcta agagcttgta tggttatgga ggtctgacta ggcatgattt cacgaaggca 900 

20 agattggcat atcattgtaa ctaaaaaagc tgacattgac ccagacatat tgtactcttt 960 

ctaaaaataa taataataat gctaacagaa agaagagaac cgttcgtttg caatctacag 1020 

ctagtagaga ctttgaggaa gaattcaaca gtgtgtcttc agcagtgttc agagccaagc 1080 

aagaagttga agttgcctag accagaggac ataagtatca tgtctccttt aactagcata 114 0 

ccccgaagtg gagaagggtg cagcaggctc aaaggcataa gtcattccaa tcagccaact 1200 

25 aagttgtcct tttctggttt cgtgttcacc atggaacatt ttgattatag ttatccttct 1260 

atcttgaatc tt 1272 

<210> 83 
<211> 1272 
30 <212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Synthetic 

35 

<400> 83 

gcctctcact aactaatcac tttcccatct tttgttagat ttgaatatat acattctatg 60 

atcattgctc tttctcttta caggggagaa tttcatattt tacctgagca aattgattag 120 

aaaatggaac cactagagga atataatgtg ttaggaaatt acagtcattt ctaagggccc 180 

40 agcccttgac aaaattgtga agttaaattc tccactctgt ccatcagata ctatggttct 240 

ccactatggc aactaactca ctcaattttc cctccttagc agcattccat cttcccgatc 300 

ttctttgctt ctccaaccaa aacatcaatg tttattagtt ctgtatacag tacaggatct 360 

ttggtctact ctatcacaag gccagtacca cactcatgaa gaaagaacac aggagtagct 420 

gagaggctaa aactcatcaa aaacactact ccttttcctc taccctattc ctcaatcttt 480 

45 taccttttcc aaatcccaat ccccaaatca gtttttctct ttcttactcc ctctctccct 540 
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tttaccctcc atggtcgtta aaggagagat ggggagcatc attctgttat actcctgtac 600. 
acagttatac atgtctatca aacccagact tgcttccata gtggagactt gctt:tt:caga 660 
acatagggat gaagtaaggt gcctgaaaag tttgggggaa aagtttcttt cagagagtta 720 
agttatttta tatatataat atatatataa aatatataat atacaatata aatatatagt 780 
gtgtgtgtgt atgcgtgtgt gtagacacac acgcatacac acatataatg gaagcaataa 840 
gccattctaa gagcttgtat ggttatggag gtctgactag gcatgatttc acgaaggcaa 900 
gattggcata tcattgtaac taaaaaagct gacattgacc cagacatatt gtactctttc 960 
taaaaataat aataataatg ctaacagaaa gaagagaacc gttcgtttgc aatctacagc 1020 
tagtagagac tttgaggaag aattcaacag tgtgtcttca gcagtgttca gagccaagca 1080 
agaagttgaa gttgcctaga ccagaggaca taagtatcat gtctccttta actagcatac 1140 
cccgaagtgg agaagggtgc agcaggctca aaggcataag tcattccaat cagccaacta 1200 
agttgtcctt ttctggtttc gtgttcacca tggaacattt tgattatagt taatccttct 1260 
atcttgaatc tt 1272 



<210> 84 
<211> 7 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 



<400> 84 
gcggaag 

<210> 85 
<211> 1463 
<212> DNA 

<213> Homo sapiens 



<400> 85 

gaattctgta agcatttcct atgtgtacct gcccctgggc aaggtgggcc tgacttgtta 60 

gagtgttaga gttttaccct gttcctctag gagggcctgg taccaccaca gcccagcatg 120 

gtgtggtgcc tcagcaggag gcatctggtt acaatcaaca caagctgttc cagccaattt 180 

aaagaaactt caggaggaat agggttttag gagggcatgg ggaccctcct gcacccgaag 240 

ccaggatgtg ccaccaatca taaggaggca ggggcctcct tccgctgctc cctgggactc 300 

tctaggtgtc cgtggcctca gcccccctct gcacacctgc atcttccttc tcatcagctt 360 

cctctgcttt aagcgtaaac atggatgccc aggacctggc ctcaatcttc cgagtctggt 420 

acttatggtg tactgacagt gtgagaccct actcctctga tcaatcccct gggttggtga 480 

cttccctgtg caatcaatgg aagccagcga ggcagggtca catgccccgt ttagaggtgc 540 

agacttggag aaggaacgtg ggcaagtctt cccaggaaca ggtagggcag ggaggaaagg 600 

ggggcatctc tggtgcagcc cggttcggag caggaagacg cttaataaat gctgatagac 660 

tgcaggacac aggcaaaggt gctgagctgg accctttatt tctgcccttc tcccttctgg 720 

caccccggcc aggaaattgc tgcagccttt ctggaatccc gttcattttt cttactggtc 780 

cacaaaaggg gccaaatgga agcagcaaga cctgagttca aattaaatct gccaactacc 840 
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agctcagtga atctgggcga gtaacacaaa acttgagtgt ccttacctga aaaatagagg 900 

ttagagggat gctatgtgcc attgtgtgtg tgtgttgggg gtggggattg ggggtgattt 960 

gtgagcaatt ggaggtgagg gtggagccca gtgcccagca cctatgcact ggggacccaa 1020 

aaaggagcat cttctcatga ttttatgtat cagaaattgg gatggcatgt cattgggaca 1080 

5 gcgtcttttt tcttgtatgg tggcacataa atacatgtgt cttataatta atggtatctt 1140 

agatttgacg aaatatggaa tattacctgt tgtgctgatc ttgggcaaac tataatatct 1200 

ctgggcaaaa atgtccccat ctgaaaaaca gggacaacgt tcctccctca gccagccact 1260 

atggggctaa aatgagacca catctgtcaa gggttttgcc ctcacctccc tccctgctgg 132 0 

atggcatcct tggtaggcag aggtgggctt cgggcagaac aagccgtgct gagctaggac 1380 

10 caggagtgct agtgccactg tttgtctatg gagagggagg cctcagtgct gagggccaag 1440 

caaatatttg tggttatgga tta 1463 



<210> 86 

<211> 83 

15 <212> DNA 

<213> Homo 



sapiens 



<400> 86 

caggagtgct agtgccactg tttgtctatg gagagggagg cctcagtgct gagggccaag 60 
20 caaatatttg tggttatgga tta 83 

<210> 87 
<211> 614 
<212> DNA 
25 <213> Homo sapiens 



<400> 87 

gaattctgta agcatttcct atgtgtacct 
gagtgttaga gttttaccct gttcctctag 

30 gtgtggtgcc tcagcaggag gcatctggtt 
aaagaaactt caggaggaat agggttttag 
ccaggatgtg ccaccaatca taaggaggca 
tctaggtgtc cgtggcctca gcccccctct 
cctctgcttt aagcgtaaac atggatgccc 

35 acttatg^tg tactgacagt gtgagaccct 
cttccctgtg caatcaatgg aagccagcga 
agacttggag aaggaacgtg ggcaagtctt 
ggggcatctc tggt 

40 <210> 88 

<211> 661 
<212> DNA 
<213> Homo sapiens 



gcccctgggc aaggtgggcc tgacttgtta 60 
gagggcctgg taccaccaca gcccagcatg 120 
acaatcaaca caagctgttc cagccaattt 180 
gagggcatgg ggaccctcct gcacccgaag 240 
ggggcctcct tccgctgctc cctgggactc 3 00 
gcacacctgc atcttccttc - tcatcagctt 360 
aggacctggc ctcaatcttc cgagtctggt 420 
actcctctga tcaatcccct gggttggtga 480 
ggcagggtca catgccccgt ttagaggtgc 54 0 
cccaggaaca ggtagggcag ggaggaaagg 600 

614 
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<400> 88 

gaattctgta agcatttcct atgtgtacct 
gagtgttaga gttttaccct gttcctctag 
gtgtggtgcc tcagcaggag gcatctggtt 
aaagaaactt caggaggaat agggttttag 
ccaggatgtg ccaccaatca taaggaggca 
tctaggtgtc cgtggcctca gcccccctct 
cctctgcttt aagcgtaaac atggatgccc 
acttatggtg tactgacagt gtgagaccct 
cttccctgtg caatcaatgg aagccagcga 
agacttggag aaggaacgtg ggcaagtctt 
ggggcatctc tggtgcagcc cggttcggag 
t 



gcccctgggc aaggtgggcc tgacttgtta 60 
gagggcctgg taccaccaca gcccagcatg 120 
acaatcaaca caagctgttc cagccaattt 180 
gagggcatgg ggaccctcct gcacccgaag 240 
ggggcctcct tccgctgctc cctgggactc 300 
gcacacctgc atcttccttc tcatcagctt 360 
aggacctggc ctcaatcttc cgagtctggt 420 
actcctctga tcaatcccct gggttggtga 480 
ggcagggtca catgccccgt ttagaggtgc 540 
cccaggaaca ggtagggcag ggaggaaagg 600 
caggaagacg cttaataaat gctgatagac 660 

661 



<210> 89 
<211> 7 
<212> DNA 
<213> Homo 

<400> 89 
gaggaaa 



sapiens 



<210> 90 
<211> 7 
<212> DNA 

<213> Homo sapiens 
<400> 90 

caggaag 7 



<210> 91 

<211> 102 

<212> DNA 

<213> Homo sapiens 



<400> 91 

tattttatat atataatata tatataaaat atataatata caatataaat atatagtgtg 60 
tgtgtgtatg cgtgtgtgta gacacacacg catacacaca ta 102 



<210> 92 

<211> 1380 

<212> DNA 

<213> Homo sapiens 
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<400> 92 

gaattctgta agcatttcct atgtgtacct gcccctgggc aaggtgggcc tgacttgtta 60 
gagtgttaga gttttaccct gttcctctag gagggcctgg taccaccaca gcccagcatg 120 
gtgtggtgcc tcagcaggag gcatctggtt acaatcaaca caagctgttc cagccaattt 180 
5 aaagaaactt caggaggaat agggttttag gagggcatgg ggaccctcct gcacccgaag 240 
ccaggatgtg ccaccaatca taaggaggca ggggcctcct tccgctgctc cctgggactc 300 
tctaggtgtc cgtggcctca gcccccctct gcacacctgc atcttccttc tcatcagctt 360 
cctctgcttt aagcgtaaac atggatgccc aggacctggc ctcaatcttc cgagtctggt 420 
acttatggtg tactgacagt gtgagaccct actcctctga tcaatcccct gggttggtga 4 80 

10 cttccctgtg caatcaatgg aagccagcga ggcagggtca catgccccgt ttagaggtgc 540 
agacttggag aaggaacgtg ggcaagtctt cccaggaaca ggtagggcag ggaggaaagg 600 
ggggcatctc tggtgcagcc cggttcggag caggaagacg cttaataaat gctgatagac 660 
tgcaggacac aggcaaaggt gctgagctgg accctttatt tctgcccttc tcccttctgg 720 
caccccggcc aggaaattgc tgcagccttt ctggaatccc gttcattttt cttactggtc 780 

15 cacaaaaggg gccaaatgga agcagcaaga cctgagttca aattaaatct gccaactacc 840 
agctcagtga atctgggcga gtaacacaaa acttgagtgt ccttacctga aaaatagagg 900 
ttagagggat gctatgtgcc attgtgtgtg tgtgttgggg gtggggattg ggggtgattt 960 
gtgagcaatt ggaggtgagg gtggagccca gtgcccagca cctatgcact ggggacccaa 1020 
aaaggagcat cttctcatga ttttatgtat cagaaattgg gatggcatgt cattgggaca 1080 

20 gcgtcttttt tcttgtatgg tggcacataa atacatgtgt cttataatta atggtatttt 1140 
agatttgacg aaatatggaa tattacctgt tgtgctgatc ttgggcaaac tataatatct 1200 . 
ctgggcaaaa atgtccccat ctgaaaaaca gggacaacgt tcctccctca gccagccact 1260 
atggggctaa aatgagacca catctgtcaa gggttttgcc ctcacctccc tccctgctgg 1320 
atggcatcct tggtaggcag aggtgggctt cgggcagaac aagccgtgct gagctaggac 1380 



25 



30 



35 



40 



<210> 93 
<211> 154 
<212> DNA 

<213> Homo sapiens 
<400> 93 

ttgggggaaa agtttctttc agagagttaa gttattttat atatataata tatatataaa 60 
atatataata tacaatataa atatatagtg tgtgtgtgta tgcgtgtgtg tagacacaca 120 
cgcatacaca catataatgg aagcaataag ccat 154 

<210> 94 
<211> 153 
<212> DNA 

<213> Homo sapiens 
<400> 94 

tgggggaaaa gtttctttca gagagttaag ttattttata tatataatat atatataaaa 60 
catacaatat acaatataaa tatatagtgt gtgtgtgtat gcgtgtgtgt agacacacac 120 
gcatacacac atataatgga agcaataagc cat 153 
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<210> 95 

<211> 152 

<212> DNA 

<213> Homo sapiens 

<400> 95 

gggggaaaag tttctttcag agagttaagt tattttatat atataatata tatataaaat 60 

atataatata caatataaat atatagtgtg tgtgtgtatg cgtgtgtgta gacacacacg 120 
catacacaca tataatggaa gcaataagcc at 152 

<210> 96 

<211> 151 

<212> DNA 

<213> Homo sapiens 

<400> 96 

ggggaaaagt ttctttcaga gagttaagtt attttatata tataatatat atataaaata 60 

tataatatac aatataaata tatagtgtgt gtgtgtatgc gtgtgtgtag acacacacgc 12 0 

atacacacat ataatggaag caataagcca t 151 

<210> 97 

<211> 150 

<212> DNA 

<213> Homo sapiens 

<400> 97 

gggaaaagtt tctttcagag agttaagtta ttttatatat ataatatata tataaaatat 6 0 

at.aatataca atataaatat atagtgtgtg tgtgtatgcg tgtgtgtaga cacacacgca 12 0 
tacacacata taatggaagc aataagccat 150 

<210> 98 

<2il> 149 

<212> DNA 

<213> Homo sapiens 

<400> 98 

ggaaaagttt ctttcagaga gttaagttat tttatatata taatatatat ataaaatata 60 

taatatacaa tataaatata tagtgtgtgt gtgtatgcgt gtgtgtagac acacacgcat 120 
acacacatat aatggaagca ataagccat 149 

<210> 99 

<211> 148 

<212> DNA 

<213> Homo sapiens 
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<400> 99 

gaaaagtttc ttCcagagag ttaagttatt ttatatatat aatatatata taaaatatat 60 
aatatacaat ataaatatat agtgtgtgtg tgtatgcgtg tgtgtagaca cacacgcata 120 
cacacatata atggaagcaa taagccat 148 

<210> 100 
<211> 147 
<212> DNA 

<213> Homo sapiens 
<400> loo 

aaaagtttct ttcagagagt taagttattt tatatatata atatatatat aaaatatata 60 
atatacaata taaatatata gtgtgtgtgt gtatgcgtgt gtgtagacac acacgcatac 120 
acacatataa tggaagcaat aagccat 147 

<210> 101 
<211> 146 
<2125: DNA 

<213> Homo sapiens 
<400> 101 

aaagtttctt tcagagagtt aagttatttt atatatataa tatatatata aaatatataa 60 
tatacaatat aaatatatag tgtgtgtgtg tatgcgtgtg tgtagacaca cacgcataca 120 
cacatataat ggaagcaata agccat 146 

<210> 102 
<211> 145 
<212> DNA 

<213> Homo sapiens 
<400> 102 

aagtttcttt cagagagtta agttatttta tatatataat atatatataa aatatataat 60 
atacaatata aatatatagt gtgtgtgtgt atgcgtgtgt gtagacacac acgcatacac 120 
acatataatg gaagcaataa gccat 145 

<210> 103 
<211> 144 
<212> DNA 

<213> Homo sapiens 
<400> 103 

agtttctttc agagagttaa gttattttat atatataata tatatataaa atatataata 60 
tacaatataa atatatagtg tgtgtgtgta tgcgtgtgtg tagacacaca cgcatacaca 120 
catataatgg aagcaataag ccat 1'44 



45 
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<210> 104 
<211> 143 
<212> DNA 
<213> Homo sapiens 

<400> 104 

gttt'ctttca gagagttaag ttattttata tatataatat atatataaaa tatataatat 60 

acaatataaa tatatagtgt gtgtgtgtat gcgtgtgtgt agacacacac gcatacacac 120 
atataatgga agcaataagc cat 143 

<210> 105 
<211> 142 
<212> DNA 
<213> Homo sapiens 

<400> 105 

tttctttcag agagttaagt tattttatat atataatata tatataaaat atataatata 60 

caatataaat atatagtgtg tgtgtgtatg cgtgtgtgta gacacacacg catacacaca 120 

tataatggaa gcaataagcc at 142 

<210> 106 
<211> 141 
<212> DNA 
<213> Homo sapiens 

<400> 106 

ttctttcaga gagttaagtt attttatata tataatatat atataaaata tataatatac 60 
aatataaata tatagtgtgt gtgtgtatgc gtgtgtgtag acacacacgc atacacacat 120 
ataatggaag caataagcca t 141 

<210> 107 
<211> 140 
<212> DNA 
<213> Homo sapiens 

<400> 107 

tctttcagag agttaagtta ttttatatat ataatatata tataaaatat ataatataca 60 
atataaatat atagtgtgtg tgtgtatgcg tgtgtgtaga cacacacgca tacacacata 120 
taatggaagc aataagccat 140 

<210> 108 
<211> 139 
<212> DNA 
<213> Homo sapiens 
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<400> 108 

ctttcagaga gttaagttat tttatatata taatatatat ataaaatata taatatacaa 60 

tataaatata tagtgtgtgt gtgtatgcgt gtgtgtagac acacacgcat acacacatat 120 
aatggaagca ataagccat 139 

<210> 109 
<211> 138 
<212> DNA 

<213> Homo sapiens 
<400> 109 

tttcagagag ttaagttatt ttatatatat aatatatata taaaatatat aatatacaat 60 
ataaatatat agtgtgtgtg tgtatgcgtg tgtgtagaca cacacgcata cacacatata 120 
atggaagcaa taagccat 138 

<210> 110 
<211> 137 
<212> DNA 

<213> Homo sapiens 
<400> 110 

ttcagagagt taagttattt tatatatata atatatatat aaaatatata atatacaata 60 
taaatatata gtgtgtgtgt gtatgcgtgt gtgtagacac acacgcatac acacatataa 120 
tggaagcaat aagccat 137 

<210> 111 
<211> 136 
<212>.DNA 

<213> Homo sapiens 
<400> 111 

tcagagagtt aagttatttt atatatataa tatatatata aaatatataa tatacaatat 60 
aaatatatag tgtgtgtgtg tatgcgtgtg tgtagacaca cacgcataca cacatataat 120 
ggaagcaata agccat 136 

<210> 112 
<211> 135 
<212> DNA 

<213> Homo sapiens 

<400> 112 • 

cagagagtta agttatttta tatatataat atatatataa aatatataat atacaatata 60 
aatatatagt gtgtgtgtgt atgcgtgtgt gtagacacac acgcatacac acatataatg 120 
gaagcaataa gccat 135 
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<210> 113 
<211> 134 
<212> DNA 

<213> Homo sapiens 
<400> 113 

agagagttaa gttattttat atatataata tatatataaa atatataata tacaatataa 60 
atatatagtg tgtgtgtgta tgcgtgtgtg tagacacaca cgcatacaca catataatgg 120 
aagcaataag ccat 134 

<210> 114 
<211> 133 
<212> DNA 

<213> Homo sapiens 
<400> 114 

gagagttaag ttattttata tatataatat atatataaaa tatataatat acaatataaa 60 
tatatagtgt gtgtgtgtat gcgtgtgtgt agacacacac gcatacacac atataatgga 12 0 
agcaataagc cat 133 

<210> 115 
<211> 132 
<212> DNA 

<213> Homo sapiens 
<400> 115 

agagttaagt tattttatat atataatata tatataaaat atataatata caatataaat 60 
atatagtgtg tgtgtgtatg cgtgtgtgta gacacacacg catacacaca tataatggaa 120 
gcaataagcc at 132 

<210> 116 
<211> 131 
<212> DNA 

<213> Homo sapiens 
<400> 116 

gagttaagtt attttatata tataatatat atataaaata tataatatac aatataaata 60 
tatagtgtgt gtgtgtatgc gtgtgtgtag acacacacgc atacacacat ataatggaag 120 
caataagcca t 131 

<210> 117 

<211> 130 

<212> DNA 

<213> Homo sapiens 
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<400> 117 

agttaagtta ttttatatat ataatatata tataaaatat ataatataca atataaatat 60 

atagtgtgtg tgtgtatgcg tgtgtgtaga cacacacgca tacacacata taatggaagc 120 
aataagccat 130 

<210> 118 
<211> 129 
<212> DNA 
<213> Homo sapiens 

c400> 118 

gttaagttat tttatatata taatatatat ataaaatata taatatacaa tataaatata 60 

tagtgtgtgt gtgtatgcgt gtgtgtagac acacacgcat acacacatat aatggaagca 120 

ataagccat 129 

<210> 119 
<211> 128 
<212> DNA 
<213> Homo sapiens 

<400> 119 

ttaagttatt ttatatatat aatatatata taaaatatat aatatacaat ataaatatat 60 

agtgtgtgtg tgtatgcgtg tgtgtagaca cacacgcata cacacatata atggaagcaa 120 

taagccat 128 

<210> 120 
<211> 127 
<212> DNA 
<213> Homo sapiens 

<400> 120 

taagttattt tatatatata atatatatat aaaatatata atatacaata taaatatata 60 

gtgtgtgtgt gtatgcgtgt gtgtagacac acacgcatac acacatataa tggaagcaat 120 
aagccat 127 

<210> 121 
<211> 126 
<212> DNA 
<213> Homo sapiens 

<400> 121 

aagttatttt atatatataa tatatatata aaatatataa tatacaatat aaatatatag 60 

tgtgtgtgtg tatgcgtgtg tgtagacaca cacgcataca cacatataat ggaagcaata 120 
agccat 126 
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<210> 122 
<211> 125 
<212> DNA 

<213> Homo sapiens 
<400> 122 

agttatttta tatatataat atatatataa aatatataat atacaatata aatatatagt 60 
gtgtgtgtgt atgcgtgtgt gtagacacac acgcatacac acataCaatg gaagcaataa 120 
gccat 125 

<210> 123 
<211> 124 
<212> DNA 

<213> Homo sapiens 
<400> 123 

gttattttat atatataata tatatataaa atatataata tacaatataa atatatagtg 60 
tgtgtgtgta tgcgtgtgtg tagacacaca cgcatacaca catataatgg aagcaataag 120 
ccat 124 

<210> 124 
<211> 123 
<212> DNA 

<213> Homo sapiens 
<400> 124 

ttattttata tatataatat atatataaaa tatataatat acaatataaa tatatagtgt 60 
gtgtgtgtat gcgtgtgtgt agacacacac gcatacacac atataatgga agcaataagc 120 
cat 123 

<210> 125 
<211> 153 
<212> DNA 
<213> Homo sapiens 

<400> 125 

ttgggggaaa agtttctttc agagagttaa gttattttat atatataata tatatataaa 60 

atatataata tacaatataa atatatagtg tgtgtgtgta tgcgtgtgtg tagacacaca 120 

cgcatacaca catataatgg aagcaataag cca 153 

<21p> 126 
<211> 152 
<212> DNA 
<213> Homo sapiens 
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<400> 126 

ttgggggaaa agtttctttc agagagttaa gttattttat atatataata tatatataaa 60 

atatataata tacaatataa atatatagtg tgtgtgtgta tgcgtgtgtg tagacacaca 120 
cgcatacaca catataatgg aagcaataag cc 152 

<210> 127 
<211> 151 
<212> DNA 

<213> Homo sapiens 
<400> 127 

ttgggggaaa agtttctttc agagagttaa gttattttat atatataata tatatataaa 60 
atatataata tacaatataa atatatagtg tgtgtgtgta tgcgtgtgtg tagacacaca 120 
cgcatacaca catataatgg aagcaataag c 151 

<210> 128 
<211> 150 
<212> DNA 
<213> Homo sapiens 

<400> 128 

ttgggggaaa agtttctttc agagagttaa gttattttat atatataata tatatataaa 60 

atatataata tacaatataa atatatagtg tgtgtgtgta tgcgtgtgtg tagacacaca 120 
cgcatacaca catataatgg aagcaataag 150 

<210> 129 
<211> 149 
<212> DNA 
<213> Homo sapiens 

<400> 129 

ttgggggaaa agtttctttc agagagttaa gttattttat atatataata tatatataaa 60 

atatataata tacaatataa atatatagtg tgtgtgtgta tgcgtgtgtg tagacacaca 120 
cgcatacaca catataatgg aagcaataa 149 

<210> 130 
<211> 148 
<212> DNA 

<213> Homo sapiens 
<400> 130 

ttgggggaaa agtttctttc agagagttaa gttattttat atatataata tatatataaa 60 
atatataata tacaatataa atatatagtg tgtgtgtgta tgcgtgtgtg tagacacaca 12.0 
cgcatacaca catataatgg aagcaata 148 
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<210> 131 
<211> 147 
<212> DNA 

<213> Homo sapiens- 
<400> 131 

ttgggggaaa agtttctttc agagagttaa gttattttat atatataata tatatataaa 60 
atatataata tacaatataa atatatagtg tgtgtgtgta tgcgtgtgtg tagacacaca 12 0 
cgcatacaca catataatgg aagcaat 147 

<210> 132 
<211> 146 
<212> DNA 
<213> Homo sapiens 

<400> 132 

ttgggggaaa agtttctttc agagagttaa gttattttat atatataata tatatataaa 60 

atatataata tacaatataa atatatagtg tgtgtgtgta tgcgtgtgtg tagacacaca 120 
cgcatacaca catataatgg aagcaa 146 

<210> 133 
<211> 145 
<212> DNA 
<213> Homo sapiens 

<400> 133 

ttgggggaaa agtttctttc agagagttaa gttattttat atatataata tatatataaa 60 

atatataata tacaatataa atatatagtg tgtgtgtgta tgcgtgtgtg tagacacaca 120 
cgcatacaca catataatgg aagca 145 

<210> 134 
<211> 144 
<212> DNA 
<213> Homo sapiens 

<400> 134 

ttgggggaaa agtttctttc agagagttaa gttattttat atatataata tatatataaa 60 

atatataata tacaatataa atatatagtg tgtgtgtgta tgcgtgtgtg tagacacaca 120 
cgcatacaca catataatgg aagc 144 

<2i0> 135 
<211> 143 
<212> DNA 
<213> Homo sapiens 
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<400> 135 

ttgggggaaa agtttctttc agagagttaa gttattttat atatataata tatatataaa 60 

atatataata tacaatataa atatatagtg tgtgtgtgta tgcgtgtgtg tagacacaca 120 

cgcatacaca catataatgg aag 143 

<210> 136 
<211> 142 
<212> DNA 

<213> Homo sapiens 
<400> 136 

ttgggggaaa agtttctttc agagagttaa gttattttat atatataata tatatataaa 60 
atatataata tacaatataa atatatagtg tgtgtgtgta tgcgtgtgtg tagacacaca 120 
cgcatacaca catataatgg aa 142 

<210> 137 
<211> 141 
<212> DNA 

<213> Homo sapiens 
<400> 137 

ttgggggaaa agtttctttc agagagttaa gttattttat atatataata tatatataaa 60 
atatataata tacaatataa atatatagtg tgtgtgtgta tgcgtgtgtg tagacacaca 120 
cgcatacaca catataatgg a 141 

<210> 138 
<211> 140 
<212> DNA 

<213> Homo sapiens 
<400> 138 

ttgggggaaa agtttctttc agagagttaa gttattttat atatataata tatatataaa 60 
atatataata tacaatataa atatatagtg tgtgtgtgta tgcgtgtgtg tagacacaca 120 
cgcatacaca catataatgg 140 

<210> 139 
<211> 139 
<212> DNA 

<213> Homo sapiens 
<400> 139 

ttgggggaaa agtttctttc agagagttaa gttattttat atatataata tatatataaa 60 
atatataata tacaatataa atatatagtg tgtgtgtgta tgcgtgtgtg tagacacaca 120 
cgcatacaca catataatg 139 
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<210> 140 
<211> 138 
<212> DNA 
<213> Homo sapiens 

<400> 140 

ttgggggaaa agtttctttc agagagttaa gttattttat atatataata tatatataaa 60 

atatataata tacaatataa atatatagtg tgtgtgtgta tgcgtgtgtg tagacacaca 120 
cgcatacaca catataat 138 

<210> 141 
<211> 137 
<212> DNA 
<213> Homo sapiens 

<400> 141 

ttgggggaaa agtttctttc agagagttaa gttattttat atatataata tatatataaa 60 

atatataata tacaatataa atatatagtg tgtgtgtgta tgcgtgtgtg tagacacaca 120 
cgcatacaca catataa 137 

<210> 142 
<211> 136 
<212> DNA 
<213> Homo sapiens 

<400> 142 

ttgggggaaa agtttctttc agagagttaa gttattttat atatataata tatatataaa 60 

atatataata tacaatataa atatatagtg tgtgtgtgta tgcgtgtgtg tagacacaca 120 
cgcatacaca catata 136 

<210> 143 
<211> 135 
<212> DNA 
<213> Homo sapiens 

<400> 143 

ttgggggaaa agtttctttc agagagttaa gttattttat atatataata tatatataaa 60 

atatataata tacaatataa atatatagtg tgtgtgtgta tgcgtgtgtg tagacacaca 120 
cgcatacaca catat 135 

<210> 144 
<211> 134 
<212> DNA 
<213> Homo sapiens 
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<400> 144 

ttgggggaaa agtttctttc agagagttaa gttattttat atatataata tatatataaa 60 
atatataata tacaatataa atatatagtg tgtgtgtgta tgcgtgtgtg tagacacaca 120 
cgcatacaca cata 134 

<210> 145 
<:211> 659 
<212> DNA 
<213> Homo sapiens 



<400> 145 

gaattctgta agcatttcct atgtgtacct 
gagtgttaga gttttaccct gttcctctag 
gtgtggtgcc tcagcaggag gcatctggtt 
aaagaaactt caggaggaat agggttttag 
ccaggatgtg ccaccaatca taaggaggca 
tctaggtgtc cgtggcctca gcccccctct 
cctctgcttt aagcgtaaac atggatgccc 
acttatggtg tactgacagt gtgagaccct 
cttccctgtg caatcaatgg aagccagcga 
agacttggag aaggaacgtg ggcaagtctt 
ggggcatctc tggtgcagcc cggttcggag 



gcccctgggc aaggtgggcc tgacttgtta 60 
gagggcctgg taccaccaca gcccagcatg 12 0 
acaatcaaca caagctgttc cagccaattt 180 
gagggcatgg ggaccctcct gcacccgaag 240 
ggggcctcct tccgctgctc cctgggactc 300 
gcacacctgc atcttccttc tcatcagctt 360 
aggacctggc ctcaatcttc cgagtctggt 420 
actcctctga tcaatcccct gggttggtga 480 
ggcagggtca catgccccgt ttagaggtgc 540 
cccaggaaca ggtagggcag ggaggaaagg 600 
caggaagacg cttaataaat gctgataga 659 



<210> 146 
<211> 658 
<212> DNA 
<213> Homo sapiens 



<400> 146 

gaattctgta agcatttcct atgtgtacct 
gagtgttaga gttttaccct gttcctctag 
gtgtggtgcc tcagcaggag gcatctggtt 
aaagaaactt caggaggaat agggttttag 
ccaggatgtg ccaccaatca taaggaggca 
tctaggtgtc cgtggcctca gcccccctct 
cctctgcttt aagcgtaaac atggatgccc 
acttatggtg tactgacagt gtgagaccct 
cttccctgtg caatcaatgg aagccagcga 
agacttggag aaggaacgtg ggcaagtctt 
ggggcatctc tggtgcagcc cggttcggag 



gcccctgggc aaggtgggcc tgacttgtta 60 
gagggcctgg taccaccaca gcccagcatg 120 
acaatcaaca caagctgttc cagccaattt 180 
gagggcatgg ggaccctcct gcacccgaag 240 
ggggcctcct tccgctgctc cctgggactc 3 00 
gcacacctgc atcttccttc tcatcagctt 360 
aggacctggc ctcaatcttc cgagtctggt 420 
actcctctga tcaatcccct gggttggtga 480 
ggcagggtca catgccccgt ttagaggtgc 540 
cccaggaaca ggtagggcag ggaggaaagg 600 
caggaagacg cttaataaat gctgatag 658 



<210> 147 
<211> 657 
<212> DNA 
<213> Homo sapiens 
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<400> 147 

gaattctgta agcatttcct atgtgtacct 
gagtgttaga gttttaccct gttcctctag 
gtgtggtgcc tcagcaggag gcatctggtt 
aaagaaactt caggaggaat agggttttag 
ccaggatgtg ccaccaatca taaggaggca 
tctaggtgtc cgtggcctca gcccccctct 
cctctgcttt aagcgtaaac atggatgccc 
acttatggtg tactgacagt gtgagaccct 
cttccctgtg caatcaatgg aagccagcga 
agacttggag aaggaacgtg ggcaagtctt 
99ggcatctc tggtgcagcc cggttcggag 



gcccctgggc aaggtgggcc tgacttgtta 60 
gagggcctgg taccaccaca gcccagcatg 120 
acaatcaaca caagctgttc cagccaattt 180 
gagggcatgg ggaccctcct gcacccgaag 240 
ggggcctcct tccgctgctc cctgggactc 300 
gcacacctgc atcttccttc tcatcagctt 360 
aggacctggc ctcaatcttc cgagtctggt 420 
actcctctga tcaatcccct gggttggtga 480 
ggcagggtca catgccccgt ttagaggtgc 540 
cccaggaaca ggtagggcag ggaggaaagg 600 
caggaagacg cttaataaat gctgata 657 



<210> 148 
<211> 656 
<212> DNA 

<213> Homo sapiens 
<400> 148 

gaattctgta agcatttcct atgtgtacct gcccctgggc aaggtgggcc tgacttgtta 60 

gagtgttaga gttttaccct gttcctctag gagggcctgg taccaccaca gcccagcatg 120 

gtgtggtgcc tcagcaggag gcatctggtt acaatcaaca caagctgttc cagccaattt 180 

aaagaaactt caggaggaat agggttttag gagggcatgg ggaccctcct gcacccgaag 240 

ccaggatgtg ccaccaatca taaggaggca ggggcctcct tccgctgctc cctgggactc 300 

tctaggtgtc cgtggcctca gcccccctct gcacacctgc atcttccttc tcatcagctt 360 

cctctgcttt aagcgtaaac atggatgccc aggacctggc ctcaatcttc cgagtctggt 420 

acttatggtg tactgacagt gtgagaccct actcctctga tcaatcccct gggttggtga 480 

cttccctgtg caatcaatgg aagccagcga ggcagggtca catgccccgt ttagaggtgc 540 

agacttggag aaggaacgtg ggcaagtctt cccaggaaca ggtagggcag ggaggaaagg 600 

ggggcatctc tggtgcagcc cggttcggag. caggaagacg cttaataaat gctgat 656 

<210> 149 
<211> 655 
<212> DNA 

<213> Homo sapiens 



<400> 149 

gaattctgta agcatttcct 
gagtgttaga gttttaccct 
gtgtggtgcc tcagcaggag 
aaagaaactt caggaggaat 
ccaggatgtg ccaccaatca 
tctaggtgtc cgtggcctca 
cctctgcttt aagcgtaaac 
acttatggtg tactgacagt 



atgtgtacct gcccctgggc 
gttcctctag gagggcctgg 
gcatctggtt acaatcaaca 
agggttttag gagggcatgg 
taaggaggca ggggcctcct 
gcccccctct gcacacctgc 
atggatgccc aggacctggc 
gtgagaccct actcctctga 



aaggtgggcc tgacttgtta 60 
taccaccaca gcccagcatg 120 
caagctgttc cagccaattt 180 
ggaccctcct gcacccgaag 240 
tccgctgctc cctgggactc 300 
atcttccttc tcatcagctt 360 
ctcaatcttc cgagtctggt 420 
tcaatcccct gggttggtga 4 80 
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cttccctgtg caatcaatgg aagccagcga 
agacttggag aaggaacgtg ggcaagtctt 
ggggcatctc tggtgcagcc cggttcggag 

5 <210> 150 

<211> 654- 

<212> DNA 

<213> Homo sapiens 

10 c400> 150 

gaattctgta agcatttcct atgtgtacct 
gagtgttaga gttttaccct gttcctctag 
gtgtggtgcc tcagcaggag gcatctggtt 
aaagaaactt caggaggaat agggttttag 

15 ccaggatgtg ccaccaatca taaggaggca 
tctaggtgtc cgtggcctca gcccccctct 
cctctgcttt aagcgtaaac atggatgccc 
acttatggtg tactgacagt gtgagaccct 
cttccctgtg caatcaatgg aagccagcga 

20 agacttggag aaggaacgtg ggcaagtctt 
ggggcatctc tggtgcagcc cggttcggag 

<210> 151 
<211> 653 
25 <212> DNA 

<213> Homo sapiens 

<400> 151 

gaattctgta agcatttcct atgtgtacct 

30 gagtgttaga gttttaccct gttcctctag 

gtgtggtgcc tcagcaggag gcatctggtt 

aaagaaactt caggaggaat agggttttag 

ccaggatgtg ccaccaatca taaggaggca 

tctaggtgtc cgtggcctca gcccccctct 

35 cctctgcttt aagcgtaaac atggatgccc 

acttatggtg tactgacagt gtgagaccct 

cttccctgtg caatcaatgg aagccagcga 

agacttggag aaggaacgtg ggcaagtctt 

ggggcatctc tggtgcagcc cggttcggag 

40 

<210> 152* 

<211> 652 

<212> DNA 

<213> Homo sapiens 

45 



ggcagggtca 


catgccccgt 


ttagaggtgc 


540 


cccaggaaca ggtagggcag 


ggaggaaagg 


600 


caggaagacg 


cttaataaat 


gctga 


655 


gcccctgggc 


aaggtgggcc 


tgacttgtta 


60 


9a99gcctgg 


taccaccaca 


gcccagcatg 


120 


acaatcaaca 


caagctgttc 


cagccaattt 


180 


gagggcatgg 


ggaccctcct 


gcacccgaag 


240 


ggggcctcct 


tccgctgctc 


cctgggactQ 


300 


gcacacctgc 


atcttccttc 


tcatcagctt 


360 


aggacctggc 


ctcaatcttc 


cgagtctggt 


420 


actcctctga 


tcaatcccct 


gggttggtga 


480 


ggcagggtca 


catgccccgt 


ttagaggtgc 


540 


cccaggaaca 


ggtagggcag 


ggaggaaagg 


600 


caggaagacg 


cttaataaat 


gctg 


654 


gcccctgggc 


aaggtgggcc 


tgacttgtta 


60 


gagggcctgg 


taccaccaca 


gcccagcatg 


120 


acaatcaaca 


caagctgttc 


cagccaattt 


180 


gagggcatgg 


ggaccctcct 


gcacccgaag 


240 


g^fcrgcctcct 


tccgctgctc 


cctgggactc 


300 


gcacacctgc 


atcttccttc 


tcatcagctt 


360 


aggacctggc 


ctcaatcttc 




420 


actcctctga 


tcaatcccct 


gggttggtga 


480 


ggcagggtca 


catgccccgt 


ttagaggtgc 


540 


cccaggaaca 


ggtagggcag 


ggaggaaagg 


600 


caggaagacg 


cttaataaat 


get 


653 
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<400> 152 

gaattctgta agcatttcct atgtgtacct 
gagtgttaga gttttaccct gttcctctag 
gtgtggtgcc tcagcaggag gcatctggtt 
S aaagaaactt caggaggaat agggttccag 
ccaggatgtg ccaccaatca taaggaggca 
tctaggtgtc cgtggcctca gcccccctct 
cctctgcttt aagcgtaaac atggatgccc 
acttatggtg tactgacagt gtgagaccct 
10 cttccctgtg caatcaatgg aagccagcga 
agacttggag aaggaacgtg ggcaagtctt 
ggggcatctc tggtgcagcc cggttcggag 

<210> 153 
15 <211> 651 

<212> DNA 

<213> Homo sapiens 
<400> 153 

20 gaattctgta agcatttcct atgtgtacct 
gagtgttaga gttttaccct gttcctctag 
gtgtggtgcc tcagcaggag gcatctggtt 
aaagaaactt caggaggaat agggttttag 
ccaggatgtg ccaccaatca taaggaggca 

25 tctaggtgtc cgtggcctca gcccccctct 
cctctgcttt aagcgtaaac atggatgccc 
acttatggtg tactgacagt gtgagaccct 
cttccctgtg caatcaatgg aagccagcga 
agacttggag aaggaacgtg ggcaagtctt 

30 ggggcatctc tggtgcagcc cggttcggag 

<2io> 154 
<211> 650 
<212> DNA 
35 <213> Homo sapiens 

<400> 154 

gaattctgta agcatttcct atgtgtacct 
gagtgttaga gttttaccct gttcctctag 

40 gtgtggtgcc tcagcaggag gcatctggtt 
aaagaaactt caggaggaat agggttttag 
ccaggatgtg ccaccaatca taaggaggca 
tctaggtgtc cgtggcctca gcccccctct 
cctctgcttt aagcgtaaac atggatgccc 

45 acttatggtg tactgacagt gtgagaccct 
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QCCCCtCrQQC 


aaQQtQQQCC 


tQacttcrtta 


60 


gagggcctog 


taccaccaca 


gcccagcatg 


120 


acaatcaaca 


caagctgttc 


cagccaat tt 


180 


gagggcatgg 


ggaccctcct 


gcacccgaag 


240 


ggggcctcct 


tccgctgctc 


cctgggactc 


300 


gcacacctgc 


atcttccttc 


tcatcagctt 


360 


aggacc'tggc 


ctcaatcttc 


cgagtctggt 


420 


actcctctga 


tcaatcccct 


gggttggtga 


480 


ggcagggtca 


catgccccgt 


ttagaggtgc 


540 


cccaggaaca 


ggtagggcag 


ggaggaaagg 


600 


caggaagacg 


cttaataaat 


gc 


652 


QCCCCtQCTQC 


aaoQ t QQor c c 


tgacttgtta 


60 


QaQQQCCtQCI 


taccaccaca 


Qcccaocato 


120 


acaatcaaca 


caagctgttc 


cagccaat tt 


180 


gagggcatgg 


ggaccctcct 


gcacccgaag 


240 


ggggcctcct 


tccgctgctc 


cctgggactc 


300 


gcacacctgc 


atcttccttc 


tcatcagctt 


360 


aggacctggc 


ctcaatcttc 


cgagtctggt 


420 


actcctctga 


tcaatcccct 


gggttggtga 


480 


ggcagggtca 


catgccccgt 


ttagaggtgc 


540 


cccaggaaca 


ggtagggcag 


ggaggaaagg 


600 


caggaagacg 


cttaataaat 


g 


651 


QCCCCtQQQC 


aaorcr t qqct c c 


tgacttgtta 


60 


gaQQQC C t QQ 


taccaccaca 


gcccagcatg 


120 


CL\v Cl CI. L. CL 






J. O V 


gagggcatgg 


ggaccctcct 


gcacccgaag 


240 


ggggcctcct 


tccgctgctc 


cctgggactc 


300 


gcacacctgc 


atcttccttc 


tcatcagctt 


360 


aggacctggc 


ctcaatcttc 


cgagtctggt 


420 


actcctctga 


tcaatcccct 


gggttggtga 


480 
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cttccctgtg caatcaatgg aagccagcga ggcagggtca catgccccgc ttagaggtgc 540 

agacttggag aaggaacgtg ggcaagtctt cccaggaaca ggtagggcag ggaggaaagg 600 

ggggcatctc tggtgcagcc cggttcggag caggaagacg cttaataaat 650 

5 <210> 155 

<211> 649 
<212> DNA 

<213> Homo sapiens 
10 <400> 155 

gaattctgta agcatttcct atgtgtacct gcccctgggc aaggtgggcc tgacttgtta 60 

gagtgttaga gttttaccct gttcctctag gagggcctgg taccaccaca gcccagcatg 12 0 

gtgtggtgcc tcagcaggag gcatctggtt acaatcaaca caagctgttc cagccaattt 180 

aaagaaactt caggaggaat agggttttag gagggcatgg ggaccctcct gcacccgaag 240 

15 ccaggatgtg ccaccaatca taaggaggca ggggcctcct tccgctgctc cctgggactc 300 

tctaggtgtc cgtggcctca gcccccctct gcacacctgc atcttccttc tcatcagctt 360 

cctctgcttt aagcgtaaac atggatgccc aggacctggc ctcaatcttc cgagtctggt 420 

acttatggtg tactgacagt gtgagaccct actcctctga tcaatcccct gggttggtga 480 

cttccctgtg caatcaatgg aagccagcga ggcagggtca catgccccgt ttagaggtgc 540 

20 agacttggag aaggaacgtg ggcaagtctt cccaggaaca ggtagggcag ggaggaaagg 600 

ggggcatctc tggtgcagcc cggttcggag caggaagacg cttaataaa 64 9 



<210> 156 
<211> 648 
25 <212> DNA 

<213> Homo sapiens 

<400> 156 

gaattctgta agcatttcct atgtgtacct gcccctgggc aaggtgggcc tgacttgtta 60 

30 gagtgttaga gttttaccct gttcctctag gagggcctgg taccaccaca -gcccagcatg 120 

gtgtggtgcc tcagcaggag gcatctggtt acaatcaaca caagctgttc cagccaattt 180 

aaagaaactt caggaggaat agggttttag gagggcatgg ggaccctcct gcacccgaag 240 

ccaggatgtg ccaccaatca taaggaggca ggggcctcct tccgctgctc cctgggactc 300 

tctaggtgtc cgtggcctca gcccccctct gcacacctgc atcttccttc tcatcagctt 360 

35 cctctgcttt aagcgtaaac atggatgccc aggacctggc ctcaatcttc cgagtctggt 42 0 

acttatggtg tactgacagt gtgagaccct actcctctga tcaatcccct gggttggtga 480 

cttccctgtg caatcaatgg aagccagcga ggcagggtca catgccccgt ttagaggtgc 54 0 

agacttggag aaggaacgtg ggcaagtctt cccaggaaca ggtagggcag ggaggaaagg 600 

ggggcatctc tggtgcagcc cggttcggag caggaagacg cttaataa 648 



40 



<210> 157 
<211> 647 
<212> DNA 

<213> Homo sapiens 
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<400> 157 

gaattctgta agcatttcct atgtgtacct gcccctgggc aaggtgggcc tgacttgtta 60 

gagtgttaga gttttaccct gttcctctag gagggcctgg taccaccaca gcccagcatg 12 0 

9tgtggtgcc tcagcaggag gcatctggtt acaatcaaca caagctgttc cagccaattt 180 

5 aaagaaactt caggaggaat agggttttag gagggcatgg ggaccctcct gcacccgaag 240 

ccaggatgtg ccaccaatca taaggaggca ggggcctcct tccgctgctc cctgggactc 300 

tctaggtgtc cgtggcctca gcccccctct gcacacctgc atcttccttc tcatcagctt 360 

cctctgcttt aagcgtaaac atggatgccc aggacctggc ctcaatcttc cgagtctggt 420 

act.tatggtg tactgacagt gtgagaccct actcctc.tga tcaatcccct gggttggtga 4 80 

10 cttccctgtg caatcaatgg aagccagcga ggcagggtca catgccccgt ttagaggtgc 540 

agacttggag aaggaacgtg ggcaagtctt cccaggaaca ggtagggcag ggaggaaagg 600 

ggggcatctc tggtgcagcc cggttcggag caggaagacg cttaata 647 



<210> 158 
IS <211> 646 

<212> DNA 

<213> Homo sapiens 
<400> 158 

20 gaattctgta agcatttcct atgtgtacct gcccctgggc aaggtgggcc tgacttgtta 60 

gagtgttaga gttttaccct gttcctctag* gagggcctgg taccaccaca gcccagcatg 120 

gtgtggtgcc tcagcaggag gcatctggtt acaatcaaca caagctgttc cagccaattt 180 

aaagaaactt caggaggaat agggttttag gagggcatgg ggaccctcct gcacccgaag 240 

ccaggatgtg ccaccaatca taaggaggca ggggcctcct tccgctgctc cctgggactc 300 

25 tctaggtgtc cgtggcctca gcccccctct gcacacctgc atcttccttc tcatcagctt 360 

cctctgcttt aagcgtaaac atggatgccc aggacctggc ctcaatcttc cgagtctggt 420 

acttatggtg tactgacagt gtgagaccct actcctctga tcaatcccct gggttggtga 4 80 

cttccctgtg caatcaatgg aagccagcga ggcagggtca catgccccgt ttagaggtgc 540 

agacttggag aaggaacgtg ggcaagtctt cccaggaaca ggtagggcag ggaggaaagg 600 

30 ggggcatctc tggtgcagcc cggttcggag caggaagacg cttaat 646 

<210> 159 
<211> 645 
<212> DNA 
35 <213> Homo sapiens 

<400> 159 

gaattctgta agcatttcct atgtgtacct gcccctgggc aaggtgggcc tgacttgtta 60 
gagtgttaga gttttaccct gttcctctag gagggcctgg taccaccaca gcccagcatg 120 

40 gtgtggtgcc tcagcaggag gcatctggtt acaatcaaca caagctgttc cagccaattt 180 
aaagaaactt caggaggaat agggttttag gagggcatgg ggaccctcct gcacccgaag 240 
ccaggatgtg ccaccaatca taaggaggca ggggcctcct tccgctgctc cctgggactc 300 
tctaggtgtc cgtggcctca gcccccctct gcacacctgc atcttccttc tcatcagctt 360 
cctctgcttt aagcgtaaac atggatgccc aggacctggc ctcaatcttc cgagtctggt 420 

45 acttatggtg tactgacagt gtgagaccct actcctctga tcaatcccct gggttggtga 480 
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cttccctgtg caatcaatgg aagccagcga 
agacttggag aaggaacgtg ggcaagtctt 
999gcatctc tggtgcagcc cggttcggag 

S <210> 160 
<21I> 644 
<212> DNA 
<213> Homo sapiens 

10 <400> 160 

gaattctgta agcatttcct atgtgtacct 
gagtgttaga gttttaccct gttcctctag 
gtgtggtgcc tcagcaggag gcatctggtt 
aaagaaactt caggaggaat agggttttag 

IS ccaggatgtg ccaccaatca taaggaggca 
tctaggtgtc cgtggcctca gcccccctct 
cctctgcttt aagcgtaaac atggatgccc 
acttatggtg tactgacagt gtgagaccct 
cttccctgtg caatcaatgg aagccagcga 

20 agacttggag aaggaacgtg ggcaagtctt 
ggggcatctc tggtgcagcc cggttcggag 

<210> 161 
<211> 643 
25 <212> DNA 

<213> Homo sapiens 

<400> 161 

gaattctgta agcatttcct atgtgtacct 
30 gagtgttaga gttttaccct gttcctctag 
gtgtggtgcc tcagcaggag gcatctggtt 
aaagaaactt caggaggaat agggttttag 
ccaggatgtg ccaccaatca taaggaggca. 
tctaggtgtc cgtggcctca gcccccctct 
35 cctctgcttt aagcgtaaac atggatgccc 

acttatggtg tactgacagt gtgagaccct 
cttccctgtg caatcaatgg aagccagcga 
agacttggag aaggaacgtg ggcaagtctt 
ggggcatctc tggtgcagcc cggttcggag 

40 

<210> 162 
<211> 642 
<212> DNA 
<213> Homo sapiens 

45 




PCTAJSOO/15728 



ggcagggtca 


catgccccgt 


ttagaggtgc 


540 


cccaggaaca 


ggtagggcag 


ggaggaaagg 


600 


caggaagacg 


cttaa 




645 


gcccctgggc 


aaqqtQQQCC 


tgacttgtta 


60 


qaqqQCCtqa 


taccaccaca 


gcccagcatg 


120 


acaatcaaca 


caagctgttc 


cagccaattt 


180 


Qaqqqcatqq 


ggaccctcct 


gcacccgaag 


240 


ggggcctcct 


tccgctgctc 


cctgggactc 


300 


gcacacctgc 


atcttccttc 


tcatcagctt 


360 


aggacctggc 


ctcaatcttc 


cgagtctggt 


420 


actcctctga 


tcaatcccct 


gggttggtga 


480 


ggcagggtca 


catgccccgt 


ttagaggtgc 


540 


cccaggaaca 


ggtagggcag 


ggaggaaagg 


600 


caggaagacg 


ctta 




644 


Qcccctqqqc 


aaqqtQQqcc 

*~**33 w^^^ w w 


tgacttgtta 


• 

60 


qaQQQCCtqq 


taccaccaca 


gcccagcatg 


120 


acaatcaaca 


caagctgttc 


cagccaattt 


180 


qaqaqcatQCi 


ggaccctcct 


ocacccciaaa 


240 


qqqqcctCCt 


tccgctgctc 


cctgggactc 


300 


gcacacctgc 


atcttccttc 


tcatcagctt 


360 






v-.y dy u w uy y L- 




actcctctga 


tcaatcccct 


gggttggtga 


480 


ggcagggtca 


catgccccgt 


ttagaggtgc 


540 


cccaggaaca 


ggtagggcag 


ggaggaaagg 


600 


caggaagacg 


ctt 




643 
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<400> 162 

gaattctgta agcatttcct atgtgtacct 
gagtgttaga gtcttaccct gttcctctag 
gtgtggtgcc tcagcaggag gcatctggtt 
S aaagaaactt caggaggaat agggttttag 
ccaggatgtg ccaccaatca taaggaggca 
tctaggtgtc cgtggcctca gcccccctct 
cctctgcttt aagcgtaaac atggatgccc 
acttatggtg tactgacagt gtgagaccct 
10 cttccctgtg caatcaatgg aagccagcga 
agacttggag aaggaacgtg ggcaagtctt 
ggggcatctc tggtgcagcc cggttcggag 

<210> 163 
15 <211> 641 

<212> DNA 

<213> Homo sapiens 



QCCCCt-QQQC 


aaocr t qqq c c 

*"33rJ**^ 


tgacttgt.ca 


60 


QaQQCl C C t QQ 


taccaccaca 


gcccagcatig 


120 


acaaCcaaca 


caagc^gt tc 


cagccaattt 


180 


qaQQQcatQQ 

3 *^ 3 3 3 3 3 


ggaccctcct 


gcacccgaag 


240 


QQQPCCtCCt 


tccgctgccc 


cctgggactc 


300 








7 fin 


aggacctggc 


ctcaatcttc 


cgagtctggt 


420 


actcctctga 


tcaatcccct 


gggttggtga 


480 


ggcagggtca 


catgccccgt 


ttagaggtgc 


540 


cccaggaaca 


ggtagggcag 


ggaggaaagg 


600 


caggaagacg 


ct 




642 



<400> 163 

20 gaattctgta agcatttcct atgtgtacct 
gagtgttaga gttttaccct gttcctctag 
gtgtggtgcc tcagcaggag gcatctggtt 
aaagaaactt caggaggaat agggttttag 
ccaggatgtg ccaccaatca taaggaggca 

25 tctaggtgtc cgtggcctca gcccccctct 
cctccgcttt aagcgtaaac atggatgccc 
acttatggtg tactgacagt gtgagaccct 
cttccctgtg caatcaatgg aagccagcga 
agacttggag aaggaacgtg ggcaagtctt 

30 ggggcatctc tggtgcagcc cggttcggag 



gcccctgggc aaggtgggcc tgacttgtta 60 
gagggcctgg taccaccaca gcccagcatg 120 
acaatcaaca caagctgttc cagccaattt 180 
gagggcatgg ggaccctcct gcacccgaag 240 
ggggcctcct tccgctgctc cctgggactc 300 
gcacacctgc atcttccttc tcatcagctt 360 
aggacctggc ctcaatcttc cgagtctggt 420 
actcctctga tcaatcccct gggttggtga 480 
ggcagggtca catgccccgt ttagaggtgc 540 
cccaggaaca ggtagggcag ggaggaaagg 600 
caggaagacg c 641 



<210> 164 
<211> 640 
<212> DNA 
35 <213> Homo sapiens 



<400> 164 

gaattctgta agcatttcct atgtgtacct 
gagtgttaga gttttaccct gttcctctag 

40 gtgtggtgcc tcagcaggag gcatctggtt 
aaagaaactt caggaggaat agggttttag 
ccaggatgtg ccaccaatca taaggaggca 
tctaggtgtc cgtggcctca gcccccctct 
cctctgcttt aagcgtaaac atggatgccc 

45 acttatggtg tactgacagt gtgagaccct 



gcccctgggc aaggtgggcc tgacttgtta 60 
gagggcctgg taccaccaca gcccagcatg 120 
acaatcaaca caagctgttc cagccaattt 180 
gagggcatgg ggaccctcct gcacccgaag 240 
ggggcctcct tccgctgctc cctgggactc 300 
gcacacctgc atcttccttc tcatcagctt 360 
aggacctggc ctcaatcttc cgagtctggt 42 0 
actcctctga tcaatcccct gggttggtga 480 
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cttccctgtg caatcaatgg aagccagcga ggcagggtca catgccccgt ttagaggtgc 540 
agacttggag aaggaacgtg ggcaagtctt cccaggaaca ggtagggcag ggaggaaagg 600 
ggggcatctc tggtgcagcc cggttcggag caggaagacg 640 

5 <210> 165 
<211> 639 
<212> DNA 

<213> Homo sapiens 
10 <400> 165 

gaattctgta agcatttcct atgtgtacct gcccctgggc aaggtgggcc tgacttgtta 60 

gagtgttaga gttttaccct gttcctctag gagggcctgg taccaccaca gcccagcatg 120 

gtgtggtgcc tcagcaggag gcatctggtt acaatcaaca caagctgttc cagccaattt 180 

aaagaaactt caggaggaat agggttttag gagggcatgg ggaccctcct gcacccgaag 24 0 

15 ccaggatgtg ccaccaatca taaggaggca ggggcctcct tccgctgctc cctgggactc 300 

tctaggtgtc cgtggcctca gcccccctct gcacacctgc atcttccttc tcatcagctt 360 

cctctgcttt aagcgtaaac atggatgccc aggacctggc ctcaatcttc cgagtctggt 420 

acttatggtg tactgacagt gtgagaccct actcctctga tcaatcccct gggttggtga 480 

cttccctgtg caatcaatgg aagccagcga ggcagggtca catgccccgt ttagaggtgc 540 

20 agacttggag aaggaacgtg ggcaagtctt cccaggaaca ggtagggcag ggaggaaagg 600 

ggggcatctc tggtgcagcc cggttcggag caggaagac 639 

<210> 166 
<211> 638 
25 <212> DNA 

<213> Homo sapiens 

<400> 166 

gaattctgta agcatttcct atgtgtacct gcccctgggc aaggtgggcc tgacttgtta 60 

30 gagtgttaga gttttaccct gttcctctag gagggcctgg taccaccaca gcccagcatg 120 

gtgtggtgcc tcagcaggag gcatctggtt acaatcaaca caagctgttc cagccaattt 180 

aaagaaactt caggaggaat agggttttag gagggcatgg ggaccctcct gcacccgaag 240 

ccaggatgtg ccaccaatca taaggaggca ggggcctcct tccgctgctc cctgggactc 300 

tctaggtgtc cgtggcctca gcccccctct gcacacctgc atcttccttc tcatcagctt 360 

35 cctctgcttt aagcgtaaac atggatgccc aggacctggc ctcaatcttc cgagtctggt 420 

acttatggtg tactgacagt gtgagaccct actcctctga tcaatcccct gggttggtga 480 

cttccctgtg caatcaatgg aagccagcga ggcagggtca catgccccgt ttagaggtgc 540 

agacttggag aaggaacgtg ggcaagtctt cccaggaaca ggtagggcag ggaggaaagg 600 

ggggcatctc tggtgcagcc cggttcggag caggaaga 638 



40 



<210> 167 

<211> 637 

<212> DNA 

<213> Homo sapiens 



45 
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<400> 167 

gaattctgta agcatttcct atgtgtacct 
gagtgttaga gttttaccct gttcctctag 
gtgtggtgcc tcagcaggag gcatctggtt 
5 aaagaaactt caggaggaat agggttttag 
ccaggatgtg ccaccaatca taaggaggca 
tctaggtgtc cgtggcctca gcccccctct 
cctctgcttt aagcgtaaac atggatgccc 
acttatggtg tactgacagt gtgagaccct 
10 cttccctgtg caatcaatgg aagccagcga 
agacttggag aaggaacgtg ggcaagtctt 
ggggcatctc tggtgcagcc cggttcggag 

<210> 168 
15 <211> 650 

<212> DNA 
<213> Homo sapiens 




PCT/USOO/15728 



gcccctgggc aaggtgggcc tgacttgtta 60 
gagggcctgg taccaccaca gcccagcatg 12 0 
acaatcaaca caagctgttc cagccaattt 180 
gagggcatgg ggaccctcct gcacccgaag 240 
ggggcctcct tccgctgctc cctgggactc 300 
gcacacctgc atcttccttc tcatcagctt 360 
aggacctggc ctcaatcttc cgagtctggt 42 0 
actcctctga tcaatcccct gggttggtga 480 
ggcagggtca catgccccgt ttagaggtgc 54 0 
cccaggaaca ggtagggcag ggaggaaagg 600 
caggaag 637 



<400> 168 

20 agcatttcct atgtgtacct gcccctgggc aaggtgggcc tgacttgtta gagtgttaga 60 

gttttaccct gttcctctag gagggcctgg taccaccaca gcccagcatg gtgtggtgcc 120 

tcagcaggag gcatctggtt acaatcaaca caagctgttc cagccaattt aaagaaactt 180 

caggaggaat agggttttag gagggcatgg ggaccctcct gcacccgaag ccaggatgtg 240 

ccaccaatca taaggaggca ggggcctcct tccgctgctc cctgggactc tctaggtgtc 300 

25 cgtggcctca gcccccctct gcacacctgc atcttccttc tcatcagctt cctctgcttt 360 

aagcgtaaac atggatgccc aggacctggc ctcaatcttc cgagtctggt acttatggtg 42 0 

tactgacagt gtgagaccct actcctctga tcaatcccct gggttggtga cttccctgtg 4 80 

caatcaatgg aagccagcga ggcagggtca catgccccgt ttagaggtgc agacttggag 540 

aaggaacgtg ggcaagtctt cccaggaaca ggtagggcag ggaggaaagg ggggcatctc 600 

.30 tggtgcagcc cggttcggag caggaagacg cttaataaat gctgatagac 650 

<210> 169 
<211> 640 
<212> DNA 
35 <213> Homo sapiens 

<400> 169 

atgtgtacct gcccctgggc aaggtgggcc tgacttgtta gagtgttaga gttttaccct 60 
gttcctctag gagggcctgg taccaccaca gcccagcatg gtgtggtgcc tcagcaggag 120 

40 gcatctggtt acaatcaaca caagctgttc cagccaattt aaagaaactt caggaggaat 180 
agggttttag gagggcatgg ggaccctcct gcacccgaag* ccaggatgtg ccaccaatca 24 0 

taaggaggca ggggcctcct tccgctgctc cctgggactc tctaggtgtc cgtggcctca 300 

gcccccctct gcacacctgc atcttccttc tcatcagctt cctctgcttt aagcgtaaac 360 

atggatgccc aggacctggc ctcaatcttc cgagtctggt acttatggtg tactgacagt 42 0 

45 gtgagaccct actcctctga tcaatcccct gggttggtga cttccctgtg caatcaatgg 480 
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aagccagcga ggcagggtca catgccccgt ttagaggtgc agacttggag aaggaacgtg 540 

ggcaagtctt cccaggaaca ggtagggcag ggaggaaagg ggggcatctc tggtgcagcc 600 
cggttcggag caggaagacg cttaataaat ' gctgatagac 640 

5 <210> 170 

<211> 610 
<212> DNA 

<213> Homo sapiens 
10 <400> 170 

tgacttgtta gagtgttaga gttttaccct gttcctctag gagggcctgg taccaccaca 60 

gcccagcatg gtgtggtgcc tcagcaggag gcatctggtt acaatcaaca caagctgttc 120 

cagccaattt aaagaaactt caggaggaat agggttttag gagggcatgg ggaccctcct 180 

gcacccgaag ccaggatgtg ccaccaatca taaggaggca ggggcctcct tccgctgctc 240 

15 cctgggactc tctaggtgtc cgtggcctca gcccccctct gcacacctgc atcttccttc 300 

tcatcagctt cctctgcttt aagcgtaaac atggatgccc aggacctggc ctcaatcttc 360 

cgagtctggt acttatggtg tactgacagt gtgagaccct actcctctga tcaatcccct 420 

gggttggtga cttccctgtg caatcaatgg aagccagcga ggcagggtca catgccccgt 480 

ttagaggtgc agacttggag aaggaacgtg ggcaagtctt cccaggaaca ggtagggcag 540 

20 ggaggaaagg ggggcatctc tggtgcagcc cggttcggag caggaagacg cttaataaat 600 
gctgatagac 610 

<210> 171 
<211> 300 
25 <212> DNA 

<213> Homo sapiens 



<400> 171 

cctctgcttt aagcgtaaac atggatgccc aggacctggc ctcaatcttc cgagtctggt 60 
30 acttatggtg tactgacagt gtgagaccct actcctctga tcaatcccct gggttggtga 120 

cttccctgtg caatcaatgg aagccagcga ggcagggtca catgccccgt ttagaggtgc 180 
agacttggag aaggaacgtg ggcaagtctt cccaggaaca ggtagggcag ggaggaaagg 240 
ggggcatctc tggtgcagcc cggttcggag caggaagacg cttaataaat gctgatagac 300 

35 <210> 172 

<211> 100 

<212> DNA 

<213> Homo sapiens 



40 * <400> 172 

ggcaagtctt cccaggaaca ggtagggcag ggaggaaagg ggggcatctc tggtgcagcc 60 
cggttcggag caggaagacg cttaataaat gctgatagac 100 
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<210> 173 

<211> 71. 

<212> DNA 

<213> Homo sapiens 

5 

<400> 173 

gggaggaaag gggggcatct ctggtgcagc ccggttcgga gcaggaagac gcttaataaa 60 
tgctgataga c 71 

10 <210> 174 

<211> 613 
<212> DNA 
<213> Homo sapiens 

15 <400> 174 

gaattctgta agcatttcct atgtgtacct gcccctgggc aaggtgggcc tgacttgtta 60 

gagtgttaga gttttaccct gttcctctag gagggcctgg taccaccaca gcccagcatg 120 

gtgtggtgcc tcagcaggag gcatctggtt acaatcaaca caagctgttc cagccaattt 180 

aaagaaactt caggaggaat agggttttag gagggcatgg ggaccctcct gcacccgaag 240 

20 ccaggatgtg ccaccaatca taaggaggca ggggcctcct tccgctgctc cctgggactc 300 

tctaggtgtc cgtggcctca gcccccctct gcacacctgc atcttccttc tcatcagctt 360 

cctctgcttt aagcgtaaac atggatgccc aggacctggc ctcaatcttc cgagtctggt 420 

acttatggtg tactgacagt gtgagaccct actcctctga tcaatcccct gggttggtga 480 

ct'tccctgtg caatcaatgg aagccagcga ggcagggtca catgccccgt ttagaggtgc 540 

25 agacttggag aaggaacgtg ggcaagtctt cccaggaaca ggtagggcag ggaggaaagg 600 
ggggcatctc tgg 613 

<210> 175 
<211> 612 
30 <212> DNA 

<213> Homo sapiens 

<400> 175 

gaattctgta agcatttcct atgtgtacct gcccctgggc aaggtgggcc tgacttgtta 60 
35 gagtgttaga gttttaccct gttcctctag gagggcctgg taccaccaca gcccagcatg 120 
gtgtggtgcc tcagcaggag gcatctggtt acaatcaaca caagctgttc cagccaattt 180 
aaagaaactt caggaggaat agggttttag gagggcatgg ggaccctcct gcacccgaag 240 
ccaggatgtg ccaccaatca taaggaggca ggggcctcct tccgctgctc cctgggactc 300 
tctaggtgtc cgtggcctca gcccccctct gcacacctgc atcttccttc tcatcagctt 360 
40 cctctgcttt aagcgtaaac atggatgccc aggacctggc ctcaatcttc cgagtctggt 420 
acttatggtg tactgacagt gtgagaccct actcctctga tcaatcccct gggttggtga 480 
cttccctgtg caatcaatgg aagccagcga ggcagggtca catgccccgt ttagaggtgc 540 
agacttggag aaggaacgtg ggcaagtctt cccaggaaca ggtagggcag ggaggaaagg 600 
ggggcatctc tg 612 

45 
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<210> 176 
<211> 611 
<212> DNA 
<213> Homo sapiens 

5 

<400> 176 

gaattctgta agcatttcct atgtgtacct 
gagtgttaga gttttaccct gttcctctag 
gtgtggtgcc ticagcaggag gcatctggtt 

10 aaagaaactt caggaggaat agggttttag 
ccaggatgtg ccaccaatca taaggaggca 
tctaggtgtc cgtggcctca gcccccctct 
cctctgcttt aagcgtaaac atggatgccc 
acttatggtg tactgacagt gtgagaccct 

15 cttccctgtg caatcaatgg aagccagcga 
agacttggag aaggaacgtg ggcaagtctt 
Srgggcatctc t 

<210> 177 

20 <211> 610 

<212> DNA 

<213> Homo sapiens 

<400> 177 

25 gaattctgta agcatttcct atgtgtacct 
gagtgttaga gttttaccct gttcctctag 
gtgtggtgcc tcagcaggag gcatctggtt 
aaagaaactt caggaggaat agggttttag 
ccaggatgtg ccaccaatca taaggaggca 

30 tctaggtgtc cgtggcctca gcccccctct 
cctctgcttt aagcgtaaac atggatgccc 
acttatggtg tactgacagt gtgagaccct 
cttccctgtg caatcaatgg aagccagcga 
agacttggag aaggaacgtg ggcaagtctt 

35 ggggcatctc 

<210> 178 
<211> 609 
<212> DNA 
40 <213> Homo sapiens 

<400> 178 

gaattctgta agcatttcct atgtgtacct 

gagtgttaga gttttaccct gttcctctag 

45 gtgtggtgcc tcagcaggag gcatctggtt 




PCT/USOO/15728 



gcccctgggc 


aaggtgggcc 


tgacttgtta 


60 


gagggcctgg 


taccaccaca 


gcccagcatg 


120 


acaatcaaca 


caagctgttc 


cagccaattt 


180 


gagggcatgg 


ggaccctcct 


gcacccgaag 


240 


ggggcctcct 


tccgctgctc 


cctgggactc 


300 


gcacacctgc 


atcttccttc 


tcatcagctt 


360 


aggacctggc 


ctcaatcttc 


cgagtctggt 


420 


actcctctga 


tcaatcccct 


gggttggtga 


480 


QQcaQQQtca 


catgccccgt 


ttacraootac 


540 


cccaggaaca 


QQ t aoaoc aa 




600 








611 


gcccctgggc 


aaggtgggcc 


tgacttgtta 


60 


gagggcctgg 


taccaccaca 


gcccagcatg 


120 


acaatcaaca 


caagctgttc 


cagccaattt 


180 


gagggcatgg 


ggaccctcct 


gcacccgaag 


240 


ggggcctcct 


tccgctgctc 


cctgggactc 


300 


gcacacctgc 


atcttccttc 


tcatcagctt 


360 


aggacctggc 


ctcaatcttc 


cgagtctggt 


420 


actcctctga 


tcaatcccct 


gggttggtga 


480 


aa c aacror t c a 


catoccccat 


^ ^y ^y y *^ 


540 


cccaggaaca 




yy**yy«*«*«yy 


600 








A 

O lO 


gcccctgggc 


aaggtgggcc 


tgacttgtta 


60 


gagggcctgg 


taccaccaca 


gcccagcatg 


120 


acaatcaaca 


caagctgttc 


cagccaattt 


180 
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aaagaaactt caggaggaat agggttttag gagggcatgg ggaccctcct gcacccgaag 240 

ccaggatgtg ccaccaatca taaggaggca ggggcctcct tccgctgctc cctgggactc 300 

tctaggtgtc cgtggcctca gcccccctct gcacacctgc atcttccttc tcatcagctt 360 

cctctgcttt aagcgtaaac atggatgccc aggacctggc ctcaatcttc cgagtctggt 420 

5 acttatggtg tactgacagt gtgagaccct actcctctga tcaatcccct gggttggtga 480 

cttccctgtg caatcaatgg aagccagcga ggcagggtca catgccccgt ttagaggtgc 540 

agacttggag aaggaacgtg ggcaagtctt cccaggaaca ggtagggcag ggaggaaagg 600 
ggggcatct 609 

10 <210> 179 

<211> 608 
<212> DNA 
<213> Homo sapiens 



15 <400> 179 

gaattctgta agcatttcct atgtgtacct 
gagtgttaga gttttaccct gttcctctag 
gtgtggtgcc tcagcaggag gcatctggtt 
aaagaaactt caggaggaat agggttttag 

20 ccaggatgtg ccaccaatca taaggaggca 
tctaggtgtc cgtggcctca gcccccctct 
cctctgcttt aagcgtaaac atggatgccc 
acttatggtg tactgacagt gtgagaccct 
cttccctgtg caatcaatgg aagccagcga 

25 agacttggag aaggaacgtg ggcaagtctt 
ggggcatc 

<210> 180 

<211> 607 

30 <212> DNA 

<213> Homo sapiens 



gcccctgggc aaggtgggcc tgacttgtta 60 
gagggcctgg taccaccaca gcccagcatg 120 
acaatcaaca caagctgttc cagccaattt 180 
gagggcatgg ggaccctcct gcacccgaag 240 
ggggcctcct tccgctgctc cctgggactc 300 
gcacacctgc atcttccttc tcatcagctt 360 
aggacctggc ctcaatcttc cgagtctggt 420 
actcctctga tcaatcccct gggttggtga 480 
ggcagggtca catgccccgt ttagaggtgc 540 
cccaggaaca ggtagggcag ggaggaaagg 600 

608 



<400> 180 

gaattctgta agcatttcct atgtgtacct gcccctgggc aaggtgggcc tgacttgtta 60 

35 gagtgttaga gttttaccct gttcctctag gagggcctgg taccaccaca gcccagcatg 120 

gtgtggtgcc tcagcaggag gcatctggtt acaatcaaca caagctgttc cagccaattt 180 

aaagaaactt caggaggaat agggttttag gagggcatgg ggaccctcct gcacccgaag 240 

ccaggatgtg ccaccaatca taaggaggca ggggcctcct tccgctgctc cctgggactc 300 

tctaggtgtc cgtggcctca gcccccctct gcacacctgc atcttccttc tcatcagctt 360 

40 cctctgcttt aagcgtaaac atggatgccc aggacctggc ctcaatcttc cgagtctggt 42 0 

acttatggtg tactgacagt gtgagaccct actcctctga tcaatcccct gggttggtga 480 

cttccctgtg caatcaatgg aagccagcga ggcagggtca catgccccgt ttagaggtgc 540 

agacttggag aaggaacgtg ggcaagtctt cccaggaaca ggtagggcag ggaggaaagg 600 
ggggcat 607 

45 
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<210> 181 
<211> 606 
<212> DNA 
<213> Homo sapiens 

5 

<400> 181 

gaatCctgta agcatttcct atgtgtacct gcccctgggc aaggtgggcc tgacttgtta 60 
gagtgttaga gttttaccct gttcctctag gagggcctgg taccaccaca gcccagcatg 120 
gtgtggtgcc tcagcaggag gcatctggtt acaatcaaca caagctgttc cagccaattt 180 

10 aaagaaactt caggaggaat agggttttag gagggcatgg ggaccctcct gcacccgaag 240 
ccaggatgtg ccaccaatca taaggaggca ggggcctcct tccgctgctc cctgggactc 300 
tctaggtgtc cgtggcctca gcccccctct gcacacctgc atcttccttc tcatcagctt 360 
cctctgcttt aagcgtaaac atggatgccc aggacctggc ctcaatcttc cgagtctggt 420 
acttatggtg tactgacagt gtgagaccct actcctctga tcaatcccct gggttggtga 480 

15 cttccctgtg caatcaatgg aagccagcga ggcagggtca catgccccgt ttagaggtgc 540 
agacttggag aaggaacgtg ggcaagtctt cccaggaaca ggtagggcag ggaggaaagg 600 
ggggca 606 

<210> 182 
20 <211> 605 

<212> DNA 

<213> Homo sapiens 
<400> 182 

25 gaattctgta agcatttcct atgtgtacct gcccctgggc aaggtgggcc tgacttgtta 60 
gagtgttaga gttttaccct gttcctctag gagggcctgg taccaccaca gcccagcatg 12 0 
gtgtggtgcc tcagcaggag gcatctggtt acaatcaaca caagctgttc cagccaattt 180 
aaagaaactt caggaggaat agggttttag gagggcatgg ggaccctcct gcacccgaag 240 
ccaggatgtg ccaccaatca taaggaggca ggggcctcct tccgctgctc cctgggactc 300 

30 tctaggtgtc cgtggcctca gcccccctct gcacacctgc atcttccttc tcatcagctt 360 
cctctgcttt aagcgtaaac atggatgccc aggacctggc ctcaatcttc cgagtctggt 420 
acttatggtg tactgacagt gtgagaccct actcctctga tcaatcccct gggttggtga 480 
cttccctgtg caatcaatgg aagccagcga ggcagggtca catgccccgt ttagaggtgc 540 
agacttggag aaggaacgtg ggcaagtctt cccaggaaca ggtagggcag ggaggaaagg 600 

35 ggggc . 605 

<210> 183 
<211> 604 
<212> DNA 
40 <213> Homo sapiens 

<400> 183 

gaattctgta agcatttcct atgtgtacct gcccctgggc aaggtgggcc tgacttgtta 60 
gagtgttaga gttttaccct gttcctctag gagggcctgg- taccaccaca gcccagcatg 120 
45 gtgtggtgcc tcagcaggag gcatctggtt acaatcaaca caagctgttc cagccaattt 180 
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aaagaaactt caggaggaat agggttttag gagggcatgg ggaccctcct gcacccgaag 240 
ccaggatgtg ccaccaatca taaggaggca ggggcctcct tccgctgctc cctgggactc 300 
tctaggtgtc cgtggcctca gcccccctct gcacacctgc atcttccttc tcatcagctt 360 
cctctgcttt aagcgtaaac atggatgccc aggacctggc ctcaatcttc cgagtctggt 420 
5 acttatggtg tactgacagt gtgagaccct actcctctga tcaatcccct gggttggtga 4 80 
cttccctgtg caatcaatgg aagccagcga ggcagggtca catgccccgt ttagaggtgc 540 
agacttggag aaggaacgtg ggcaagtctt cccaggaaca ggtagggcag ggaggaaagg 600 

gggg 604 

10 <210> 184 

<211> 603 

<212> DNA 

<213> Homo sapiens 
IS <400> 184 

gaattctgta agcatttcct atgtgtacct gcccctgggc aaggtgggcc tgacttgtta 60 

gagtgttaga gttttaccct gttcctctag gagggcctgg taccaccaca gcccagcatg 120 

gtgtggtgcc tcagcaggag gcatctggtt acaatcaaca caagctgttc cagccaattt 180 

aaagaaactt caggaggaat agggttttag gagggcatgg ggaccctcct gcacccgaag 240 

20 ccaggatgtg ccaccaatca taaggaggca ggggcctcct tccgctgctc cctgggactc 3 00 

tctaggtgtc cgtggcctca gcccccctct gcacacctgc atcttccttc tcatcagctt 360 

cctctgcttt aagcgtaaac atggatgccc aggacctggc ctcaatcttc cgagtctggt 420 

acttatggtg tactgacagt gtgagaccct actcctctga tcaatcccct gggttggtga 480 

cttccctgtg caatcaatgg aagccagcga ggcagggtca catgccccgt ttagaggtgc 540 

25 agacttggag aaggaacgtg ggcaagtctt cccaggaaca ggtagggcag ggaggaaagg 600 

ggg 603 

<210> 185 
<211> 602 
30 <212> DNA 

<213> Homo sapiens 



<400> 185 

gaattctgta agcatttcct atgtgtacct 
35 gagtgttaga gttttaccct gttcctctag 
gtgtggtgcc tcagcaggag gcatctggtt 
aaagaaactt caggaggaat agggttttag 
ccaggatgtg ccaccaatca taaggaggca 
tctaggtgtc cgtggcctca gcccccctct 
40 cctctgcttt aagcgtaaac atggatgccc 
acttatggtg tactgacagt gtgagaccct 
cttccctgtg caatcaatgg aagccagcga 
agacttggag aaggaacgtg ggcaagtctt 

gg 

45 



gcccctgggc aaggtgggcc tgacttgtta 60 

gagggcctgg taccaccaca gcccagcatg 120 

acaatcaaca caagctgttc cagccaattt 180 

gagggcatgg ggaccctcct gcacccgaag 240 

ggggcctcct tccgctgctc cctgggactc 300 

gcacacctgc atcttccttc tcatcagctt 360 

aggacctggc ctcaatcttc cgagtctggt 420 

actcctctga tcaatcccct gggttggtga 4 80 

ggcagggtca catgccccgt ttagaggtgc 540 

cccaggaaca ggtagggcag ggaggaaagg 600 

602 
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<210> 186 
<211> 601 
<212> DNA 
<213> Homo sapiens 

5 

<400> 186 

gaattctgta agcatttcct atgtgtacct gcccctgggc aaggtgggcc tgacttgtta 60 

gagtgttaga gttttaccct gttcctctag gagggcctgg taccaccaca gcccagcatg 12 0 

gtgtggtgcc tcagcaggag gcatctggtt acaatcaaca caagctgttc cagccaattt 180 

10 aaagaaactt caggaggaat agggttttag gagggcatgg ggaccctcct gcacccgaag 240- 

ccaggatgtg ccaccaatca taaggaggca ggggcctcct tccgctgctc cctgggactc 300 

tctaggtgtc cgtggcctca gcccccctct gcacacctgc atcttccttc tcatcagctt 3 60 

cctctgcttt aagcgtaaac atggatgccc aggacctggc ctcaatcttc cgagtctggt 420 • 

acttatggtg tactgacagt gtgagaccct actcctctga tcaatccect gggttggtga 4 80 

15 cttccctgtg caatcaatgg aagccagcga ggcagggtca catgccccgt ttagaggtgc 540 

agacttggag aaggaacgtg ggcaagtctt cccaggaaca ggtagggcag ggaggaaagg 600 

g 601 

<210> 187 
20 <211> 600 

<212> DNA 
<213> Homo sapiens 

<400> 187 

25 gaattctgta agcatttcct atgtgtacct gcccctgggc aaggtgggcc tgacttgtta 60 

gagtgttaga gttttaccct gttcctctag gagggcctgg taccaccaca gcccagcatg 120 

gtgtggtgcc tcagcaggag gcatctggtt' acaatcaaca caagctgttc cagccaattt 180 

aaagaaactt caggaggaat agggttttag gagggcatgg ggaccctcct gcacccgaag 240 

ccaggatgtg ccaccaatca taaggaggca ggggcctcct tccgctgctc cctgggactc 300 

30 tctaggtgtc cgtggcctca gcccccctct gcacacctgc atcttccttc tcatcagctt 360 

cctctgcttt aagcgtaaac atggatgccc aggacctggc ctcaatcttc cgagtctggt 420 

acttatggtg tactgacagt gtgagaccct actcctctga tcaatccect gggttggtga 480 

cttccctgtg caatcaatgg aagccagcga ggcagggtca catgccccgt ttagaggtgc 540 

agacttggag aaggaacgtg ggcaagtctt cccaggaaca ggtagggcag ggaggaaagg 600 



35 



40 



<210> 188 
<211> 599 
<212> DNA 
<213> Homo sapiens 



<400> 188 

gaattctgta agcatttcct atgtgtacct gcccctgggc aaggtgggcc tgacttgtta 60 
gagtgttaga gttttaccct gttcctctag gagggcctgg taccaccaca gcccagcatg 120 
gtgtggtgcc tcagcaggag gcatctggtt acaatcaaca caagctgttc cagccaattt 180 
45 aaagaaactt caggaggaat agggttttag gagggcatgg ggaccctcct gcacccgaag 240 
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ccaggatgtg ccaccaatca taaggaggca ggggcctcct tccgctgctc cctgggactc 300 

tctaggtgtc cgtggcctca gcccccctct gcacacctgc atcttccttc tcatcagctt 360 

cctctgcttt aagcgtaaac atggatgccc aggacctggc ctcaatcttc cgagtctggt 420 

acttatggtg tactgacagt gtgagaccct actcctctga tcaatcccct gggttggtga 4 80 

5 cttccctgtg caatcaatgg aagccagcga ggcagggtca catgccccgt ttagaggtgc 540 

agacttggag aaggaacgtg ggcaagtctt cccaggaaca ggtagggcag ggaggaaag 599 



<210> 189 
<211> 598 
10 <212> DNA 

<213> Homo sapiens 



<400> 189 

gaattctgta agcatttcct atgtgtacct 
15 gagtgttaga gttttaccct gttcctctag 
gtgtggtgcc tcagcaggag gcatctggtt 
aaagaaactt caggaggaat agggttttag 
ccaggatgtg ccaccaatca taaggaggca 
tctaggtgtc cgtggcctca gcccccctct 
20 cctctgcttt aagcgtaaac atggatgccc 
acttatggtg tactgacagt gtgagaccct 
cttccctgtg caatcaatgg aagccagcga 
agacttggag aaggaacgtg ggcaagtctt 

25 <210> 190 

<211> 498 
<212> DNA 

<213> Homo sapiens 



gcccctgggc aaggtgggcc tgacttgtta 60 
gagggcctgg taccaccaca gcccagcatg 120 
acaatcaaca caagctgttc cagccaattt 180 
gagggcatgg ggaccctcct gcacccgaag 240 
ggggcctcct tccgctgctc cctgggactc 300 
gcacacctgc atcttccttc tcatcagctt 360 
aggacctggc ctcaatcttc cgagtctggt 420 
actcctctga tcaatcccct gggttggtga 480 
ggcagggtca catgccccgt . ttagaggtgc 540 
cccaggaaca ggtagggcag ggaggaaa 598 



30 <400> 190 

taccaccaca gcccagcatg gtgtggtgcc 
caagctgttc cagccaattt aaagaaactt 
ggaccctcct gcacccgaag ccaggatgtg 
tccgctgctc cctgggactc tctaggtgtc 

35 atcttccttc tcatcagctt cctctgcttt 

ctcaatcttc cgagtctggt acttatggtg 
tcaatcccct gggttggtga cttccctgtg 
catgccccgt ttagaggtgc agacttggag 
ggtagggcag ggaggaaa 

40 

<210> 191 
<211> 398 
<212> DNA 

<213> Homo sapiens 

45 



tcagcaggag gcatctggtt acaatcaaca 60 
caggaggaat agggttttag gagggcatgg 120 
ccaccaatca taaggaggca ggggcctcct 180 
cgtggcctca gcccccctct gcacacctgc 240 
aagcgtaaac atggatgccc aggacctggc 300 
tactgacagt gtgagaccct actcctctga 360 
caatcaatgg aagccagcga ggcagggtca 420 
aaggaacgtg ggcaagtctt cccaggaaca 480 

498 
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<400> 191 

e^gsgttttag gagggcatgg. ggaccctcct gcacccgaag ccaggatgtg ccaccaatca 60 

taaggaggca ggggcctcct tccgctgctc cctgggactc tctaggtgtc cgtggcctca 120 

gcccccctct gcacacctgc atcttccttc tcatcagctt cctctgcttt aagcgtaaac 180 

atggatgccc aggacctggc ctcaatcttc cgagtctggt: acttatggtg tactgacagt: 240 

gtgagaccct actcctctga tcaatcccct gggttggtga cttccctgtg caatcaatgg 300 

aagccagcga ggcagggtca catgccccgt ttagaggtgc agacttggag aaggaacgtg 360 

ggcaagtctt cccaggaaca ggtagggcag ggaggaaa 398 

<210> 192 
<211> 298 
<212> DNA 

<213> Homo sapiens 



<400> 192 

tctaggtgtc cgtggcctca gcccccctct 
cctctgcttt aagcgtaaac atggatgccc 
acttatggtg tactgacagt gtgagaccct 
cttccctgtg caatcaatgg aagccagcga 
agacttggag aaggaacgtg ggcaagtctt 



gcacacctgc atcttccttc tcatcagctt 60 
aggacctggc ctcaatcttc cgagtctggt 120 
actcctctga tcaatcccct gggttggtga 180 
ggcagggtca catgccccgt ttagaggtgc 240 
cccaggaaca ggtagggcag ggaggaaa 298 



<210> 193 
<211> 198 
<212> DMA 

<213> Homo sapiens 



<400> 193 

ctcaatcttc cgagtctggt acttatggtg tactgacagt gtgagaccct actcctctga 60 

tcaatcccct gggttggtga cttccctgtg caatcaatgg aagccagcga ggcagggtca 120 

catgccccgt ttagaggtgc agacttggag aaggaacgtg ggcaagtctt cccaggaaca 180 
ggtagggcag ggaggaaa 198 

<210> 194 
<211> 98 
<212> DNA 

<213> Homo sapiens 
<400> 194 

aagccagcga ggcagggtca catgccccgt ttagaggtgc agacttggag aaggaacgtg 60 
ggcaagtctt cccaggaaca ggtagggcag ggaggaaa 98 



<210> 195 

<211> 8 

<212> DNA 

<213> Homo sapiens 
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<400> 195 
ggaggaaa 

<210> 196 
<211> 781 
<212> DNA 
<213> Homo sapiens 



<400> 196 

g99ggcatct ctggtgcagc 
ctgcaggaca caggcaaagg 
gcaccccggc caggaaattg 
ccacaaaagg ggccaaatgg 
cagctcagtg aatctgggcg 
gttagaggga tgctatgtgc 
tgtgagcaat tggaggtgag 
aaaaggagca tcttctcatg 
agcgtctttt ttcttgtatg 
tagatttgac 'gaaatatgga 
tctgggcaaa aatgtcccca 
tatggggcta aaatgagacc 
gatggcatcc ttggtaggca 
c 



ccggttcgga gcaggaagac 
tgctgagctg gaccctttat 
ctgcagcctt tctggaatcc 
aagcagcaag acctgagttc 
agtaacacaa aacttgagtg 
cattgtgtgt gtgtgttggg 
ggtggagccc agtgcccagc 
attttatgta tcagaaattg 
gtggcacata aatacaCgtg 
atattacctg ttgtgctgat 
tctgaaaaac agggacaacg 
acatctgtca agggttttgc 
gaggtgggct tcgggcagaa 



gcttaataaa 


tgctgataga 


60 


ttctgccctt 


ctcccttctg 


120 


cgttcatttt 


tcttactggt 


180 


aaattaaatc 


tgccaactac 


240 


tccttacctg 


aaaaatagag 


300 


ggtggggatt 


gggggtgatt 


360 


acctatgcac 


tggggaccca 


420 


ggatggcatg 


tcattgggac 


480 


tcttataatt 


aatggtattt 


540 


cttgggcaaa 


ctataatatc 


600 


ttcctccctc 


agccagccac 


660 


cctcacctcc 


ctccctgctg 


720 


caagccgtgc 


tgagctagga 


780 
781 



<210> 197 
<211> 771 
<212> DNA 
<213> Homo sapiens 

<400> 197 

ctggtgcagc ccggttcgga gcaggaagac gcttaataaa tgctgataga ctgcaggaca 60 

caggcaaagg tgctgagctg gaccctttat ttctgccctt ctcccttctg gcaccccggc 120 

caggaaattg ctgcagcctt tctggaatcc cgttcatttt tcttactggt ccacaaaagg 180 

ggccaaatgg aagcagcaag acctgagttc aaattaaatc tgccaactac cagctcagtg 240 

aatctgggcg agtaacacaa aacttgagtg tccttacctg aaaaatagag gttagaggga 300 

tgctatgtgc cattgtgtgt gtgtgttggg ggtggggatt gggggtgatt tgtgagcaat 360 

tggaggtgag ggtggagccc agtgcccagc acctatgcac tggggaccca aaaaggagca 420 

tcttctcatg attttatgta tcagaaattg ggatggcatg tcattgggac agcgtctttt 480 

ttcttgtatg gtggcacata aatacatgtg tcttataatt aatggtattt tagatttgac 540 

gaaatatgga atattacctg ttgtgctgat cttgggcaaa ctataatatc tctgggcaaa 600 

aatgtcccca tctgaaaaac agggacaacg ttcctccctc agccagccac tatggggcta 660 

aaatgagacc acatctgtca agggttttgc cctcacctcc ctccctgctg gatggcatcc 720 

ttggtaggca gaggtgggct tcgggcagaa caagccgtgc tgagctagga c 771 
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<210> 198 
<211> 761 
<212> DNA 
c213> Homo sapiens 

5 

<400> 198 

ccggttcgga gcaggaagac gcttaataaa tgctgataga ctgcaggaca caggcaaagg 60 

tgctgagctg gaccctttat ttctgccctt ctcccttctg gcaccccggc caggaaattg 120 

ctgcagcctt tctggaatcc cgttcatttt tcttactggt ccacaaaagg ggccaaatgg 180 

10 aagcagcaag acctgagttc aaattaaatc tgccaactac cagctcagtg aatctgggcg 240 

agtaacacaa aacttgagtg tccttacctg aaaaatagag gttagaggga tgctacgtgc 300 

cattgtgtgt gtgtgttggg ggtggggatt gggggtgatt tgtgagcaat tggaggtgag 360 

ggtggagccc agtgcccagc acctatgcac tggggaccca aaaaggagca tcttctcatg 420 

attctatgta tcagaaattg ggatggcatg tcattgggac agcgtctttt ttcttgtatg 480 

15 gtggcacata aatacatgtg tcttataatt aatggtattt tagatttgac gaaatatgga 540 

atattacctg ttgtgctgat cttgggcaaa ctataatatc tctgggcaaa aatgtcccca 600 

tctgaaaaac agggacaacg ttcctccctc agccagccac tatggggcta aaatgagacc 660 

acatctgtca agggttttgc cctcacctcc ctccctgctg gatggcatcc ttggtaggca 720 

gaggtgggct tcgggcagaa caagccgtgc tgagctagga c 761 



20 



25 



40 



<210> 199 
<211> 751 
<212> DNA 

<213> Homo sapiens 



<400> 199 

gcaggaagac gcttaataaa tgctgataga ctgcaggaca caggcaaagg tgctgagctg 60 

gaccctttat ttctgccctt ctcccttctg gcaccccggc caggaaattg ctgcagcctt 120 

tctggaatcc cgttcatttt tcttactggt ccacaaaagg ggccaaatgg aagcagcaag 180 

30 acctgagttc aaattaaatc tgccaactac cagctcagtg aatctgggcg agtaacacaa 240 

aacttgagtg tccttacctg aaaaatagag gttagaggga tgctatgtgc cattgtgtgt 300 

gtgtgttggg ggtggggatt gggggtgatt tgtgagcaat tggaggtgag ggtggagccc 360 

agtgcccagc acctatgcac tggggaccca aaaaggagca tcttctcatg attttatgta 420 

tcagaaattg ggatggcatg tcattgggac agcgtctttt ttcttgtatg gtggcacata 480 

35 aatacatgtg tcttataatt aatggtattt tagatttgac gaaatatgga atattacctg 54 0 

ttgtgctgat cttgggcaaa ctataatatc tctgggcaaa aatgtcccca tctgaaaaac 600 

agggacaacg ttcctccctc agccagccac tatggggcta aaatgagacc acatctgtca 660 

agggttttgc cctcacctcc ctccctgctg gatggcatcc ttggtaggca gaggtgggct 720 

tcgggcagaa caagccgtgc tgagctagga c 751 



<210> 200 
<211> 750 
<212> DNA 

<213> Homo sapiens 



45 
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<400> 200 

caggaagacg cttaataaat gctgatagac tgcaggacac aggcaaaggt gctgagctgg 60 
accctttatt tctgcccttc tcccttctgg caccccggcc aggaaattgc tgcagccttt 120 
ctggaatccc gttcattttt cttactggtc cacaaaaggg gccaaatgga agcagcaaga 180 
cctgagttca aattaaatct gccaactacc agctcagtga atctgggcga gtaacacaaa 240 
acttgagtgt ccttacctga aaaatagagg ttagagggat gctatgtgcc attgtgtgtg 300 
tgtgttgggg gtggggattg ggggtgattt gtgagcaatt ggaggtgagg gtggagccca 360 
gtgcccagca cctatgcact ggggacccaa aaaggagcat cttctcatga ttttatgtat 420 
cagaaattgg gatggcatgt cattgggaca gcgtcttttt tcttgtatgg tggcacataa 480 
atacatgtgt cttataatta atggtatttt agatttgacg aaatatggaa tattacctgt 540 
tgtgctgatc ttgggcaaac tataatatct ctgggcaaaa atgtccccat ctgaaaaaca 600 
gggacaacgt tcctccctca gccagccact atggggctaa aatgagacca catctgtcaa 660 
gggttttgcc ctcacctccc tccctgctgg atggcatcct tggtaggcag aggtgggctt 72 0 
cgggcagaac aagccgtgct gagctaggac 750 



<210> 201 
<211> 681 
<212> DNA 

<213> Homo sapiens 
<400> 201 

gg999catct ctggtgcagc ccggttcgga 
ctgcaggaca caggcaaagg tgctgagctg 
gcaccccggc caggaaattg ctgcagcctt 
ccacaaaagg ggccaaatgg aagcagcaag 
cagctcagtg aatctgggcg agtaacacaa 
gttagaggga tgctatgtgc cattgtgtgt 
tgtgagcaat tggaggtgag ggtggagccc 
aaaa'ggagca tcttctcatg attttatgta 
agcgtctttt ttcttgtatg gtggcacata 
tagatttgac gaaatatgga atattacctg 
tctgggcaaa aatgtcccca tctgaaaaac 
tatggggcta aaatgagacc a 



gcaggaagac 


gcttaataaa 


tgctgataga 


60 


gaccctttat 


ttctgccctt 


ctcccttctg 


120 


tctggaatcc 


cgttcatttt 


tcttactggt 


180 


acctgagttc 


aaattaaatc 


tgccaactac 


240 


aacttgagtg 


tccttacctg 


aaaaatagag 


300 


gtgtgttggg 


ggtggggatt 


gggggtgatt 


360 


agtgcccagc 


acctatgcac 


tggggaccca 


420 


tcagaaattg 


ggatggcatg 


tcattgggac 


480 


aatacatgtg 


tcttataatt 


aatggtattt 


540 


ttgtgctgat 


cttgggcaaa 


ctataatatc 


600 


^ggg^caacg 


ttcctccctc 


agccagccac 


660 
681 



<210> 202 
<211> 581 
<212> DNA 

<213> Homo sapiens 
<400> 202 

gggggcatct ctggtgcagc ccggttcgga 
ctgcaggaca caggcaaagg tgctgagctg 
gcaccccggc caggaaattg ctgcagcctt 
ccacaaaagg ggccaaatgg aagcagcaag 
cagctcagtg aatctgggcg agtaacacaa 



gcaggaagac gcttaataaa tgctgataga 60 
gaccctttat ttctgccctt ctcccttctg 120 
tctggaatcc cgttcatttt tcttactggt 180 
acctgagttc aaattaaatc tgccaactac 240 
aacttgagtg tccttacctg aaaaatagag .300 
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gttagaggga tgctatgtgc cattgtgtgt 
tgtgagcaat tggaggtgag ggtggagccc 
aaaaggagca tcttctcatg attttatgta 
agcgtctttt ttcttgtatg gtggcacata 
5 tagatttgac gaaatatgga atattacctg 

<210> 203 
<211> 481 
<212> DNA 
10 <213> Homo sapiens 




PCT/USOO/15728 



gtgtgttggg ggtggggatt gggggtgatt 360 
agtgcccagc acctatgcac tggggaccca 420 
tcagaaattg ggatggcatg tcattgggac 480 
aatacatgtg tcttataatt aatggtattt 540 
ttgtgctgat c 581 



<400> 203 

9g999catct ctggtgcagc ccggttcgga gcaggaagac gcttaataaa tgctgataga 60 

ctgcaggaca caggcaaagg tgctgagccg gaccctttat ttctgccctt ctcccttctg 120 

15 gcaccccggc caggaaattg ctgcagcctt tctggaatcc cgttcatttt tcttactggt 180 

ccacaaaagg ggccaaatgg aagcagcaag acctgagttc aaattaaatc tgccaactac 240 

cagctcagtg aatctgggcg agtaacacaa aacttgagtg tccttacctg aaaaatagag 3 00 

gttagaggga tgctatgtgc cattgtgtgt gtgtgttggg ggtggggatt gggggtgatt 360 

tgtgagcaat tggaggtgag ggtggagccc agtgcccagc acctatgcac tggggaccca 420 

20 aaaaggagca tcttctcatg attttatgta tcagaaattg ggatggcatg tcattgggac 480 
a 481 

<210> 204 
<211> 381 
25 <212> DNA 

<213> Homo sapiens 

<400> 204 

gggggcatct ctggtgcagc ccggttcgga gcaggaagac gcttaataaa tgctgataga 60 

30 ctgcaggaca caggcaaagg tgctgagctg gaccctttat ttctgccctt ctcccttctg 120 

gcaccccggc caggaaattg ctgcagcctt tctggaatcc cgttcatttt tcttactggt 180 

ccacaaaagg ggccaaatgg aagcagcaag acctgagttc aaattaaatc tgccaactac 240 

cagctcagtg aatctgggcg agtaacacaa aacttgagtg tccttacctg aaaaatagag 300 

gttagaggga tgctatgtgc cattgtgtgt gtgtgttggg ggtggggatt gggggtgatt 360 
35 tgtgagcaat tggaggtgag g 381 

<210> 205 
<211> 281 
<212> DNA 
40 <213> Homo sapiens 

<400> 205 

gggggcatct ctggtgcagc ccggttcgga gcaggaagac gcttaataaa tgctgataga 60 

ctgcaggaca caggcaaagg tgctgagctg gaccctttat ttctgccctt ctcccttctg 120 

45 gcaccccggc caggaaattg ctgcagcctt tctggaatcc cgttcatttt tcttactggt 180 
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ccacaaaagg ggccaaatgg aagcagcaag acctgagttc aaattaaatc tgccaactac 240 
*cagctcagtg aatctgggcg agtaacacaa aacttgagtg t 281 

<210> 206 
5 <211> 181 

<212> DNA 

<213> Homo sapiens 
<400> 206 

10 gggggcatcc ctggtgcagc ccggttcgga gcaggaagac gcttaataaa tgctgataga 60 
ctgcaggaca caggcaaagg tgctgagctg gaccctttat ttctgccctt ctcccttctg 120 
gcaccccggc caggaaattg ctgcagcctt tctggaatcc cgttcatttt tcttactggt 180 
c 181 

15 <210> 207 

<211> 81 
<212> DNA 

<213> Homo sapiens 
20 <400> 207 

999ggcatct ctggtgcagc ccggttcgga gcaggaagac gcttaataaa tgctgataga 60 
ctgcaggaca caggcaaagg t 81 

<210> 208 
25 <211> 38 

<212> DNA 

<213> Homo sapiens 
<400> 208 

30 gggggcatct ctggtgcagc ccggttcgga gcaggaag 38 
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